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ABSTRACT 

We present a comprehensive study of the physical properties of ~ 10 5 galaxies with mea- 
surable star formation in the SDSS. By comparing physical information extracted from the 
emission lines with continuum properties, we build up a picture of the nature of star-forming 
galaxies at z < 0.2. We develop a method for aperture correction using resolved imaging and 
show that our method takes out essentially all aperture bias in the star formation rate (SFR) 
estimates, allowing an accurate estimate of the total SFRs in galaxies. We determine the SFR 
density to be 1.915tj^(rand.)12;^(sys.) h 70 10~ 2 M /yr/Mpc 3 at z = 0.1 (for a Kroupa 
IMF) and we study the distribution of star formation as a function of various physical param- 
eters. The majority of the star formation in the low redshift universe takes place in moderately 
massive galaxies (1O 1O -1O 11 M ), typically in high surface brightness disk galaxies. Roughly 
15% of all star formation takes place in galaxies that show some sign of an active nucleus. 
About 20% occurs in starburst galaxies. By focusing on the star formation rate per unit mass 
we show that the present to past-average star formation rate, the Scalo 6-parameter; is almost 
constant over almost three orders of magnitude in mass, declining only at M* > 10 10 Mq. The 



volume averag ed b parameter is O^OS+^^^randO+^g^sysOhfo 1 . We use this value con- 
strain the star formation history of the universe. For the concordance cosmology the present 
day universe is forming stars at least 1/3 of its past average rate. For an exponentially declin- 
ing cosmic star formation history this corresponds to a time-scale of 7^' 5 Gyr. In agreement 
with other work we find a correlation between b and morphological type, as well as a tight 
correlation between the 4000A break (D4000) and b. We discuss how D4000 can be used to 
estimate b parameters for high redshift galaxies. 



1 INTRODUCTION 



One of the most active areas of research during the last decade 
has been the study of the present and past star formation histo- 
ries of galaxies. This includes detailed stu dies of the stel lar pop- 
ulation of our own Galaxy (e.g. Rocha-Pinto et al. 2000) and of 
Local Group dwarfs, both individually and as relic evidence of 
the global star formation h istory of the Universe iHernandez et alJ 
120001 IHopkins et ai]|200ll) . While potentially very powerful, the 
techniques used in these studies are difficult to extend much be- 
yond the Local Group. 

At somewhat larger distances, efforts have gone towards 
understanding the global properties of more massive galaxies 
and the dependence of st ar formation on mass and/or m orpho- 
logical classification (e.g . |Kennicutt||l983t iKennicutt et aljfl994t 
iGavazzi & Scodeggioll99rJ : lBoselri et all2001fl . This has been ac- 
companied by careful mapping of b oth the star formation and 
the gas distribution in galaxies (e.g. iMartin & Kennicut3l200ll : 



IWong & Blitzj|2002t lYoung et alJll995t) . Based on these studies, 
a picture has emerged in which more massive galaxies undergo a 
larger fraction of their star formation at early times than less mas- 
sive ones. This has often been said to present a challenge for exist- 
ing models of galaxy formation. 

These studies have traditionally been carried out with rela- 
tively small samples of local galaxies . For e xample, the compar- 
atively large sample of IGavazzi et alJ 120021) contains 369 spiral 
galaxies. With the advent of large fiber-fed surveys such as the 2dF 
<Collessetalj200ll) and the SDSS <York et all200(ih . it is now pos- 
sible to extend such studies dramatically in size. Larg e galaxy sur - 
veys have most recently been exploited by IHopkins et alJ 120031) . 
who carried out a comprehensive study of different SFR indicators, 
and by Nakamura et al. 1 2003) who studied the local Ha luminosity 
density. 

Alongside these careful studies of individual local galax- 
ies, a global picture of the star formation history of the uni- 
verse has been assembled through careful studies of star form- 
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ing galaxies as a function of look - back time j Cowj 

Song aila et alJll990l: Lilly et alJll996t IMadau et alJll99L 

l200lHMadauetal.lll998h . The main emphasis of this work has 
been the star formation history of the universe as a whole. Some 
workers have also tried to study the evolution of different m orpho- 
logical types l Brinchmann et al . 1998: lGuzman et alll997t) . These 
studies have adopted a wide variety of star formation ind icators 
jGlazebrook et alJll999j:ISullivan et alJl200ltlHopkms et alJl200lt; 
iGlazebrook et al.l2003l : lAfonso et alJ200(tlFlores et alll999n . De- 
spite differences between different indicators it is clear that there 
has been a decline in the rate of star formation since z ~ 1, al- 
though there is still no consensus on how rapid this decline was. 
There is also no clear consensus on the star formation history be- 
fore z = 2. Studies in the li terature give an swers ranging from a 
clear increase back to z = 8 I Lanzet ta et all2002|). through a con- 
stant rate of star forma tion back to z = 5 (e.g. lBaldry et alj 2002|: 
Giava lisco et alJ l2003). to a decline at high z (e.g. Ma dau et alj 

One of the key unanswered questions is what physical pa- 
rameters) drive changes in the star formation rate in individual 
galaxies. Although an increasing body of evidence points to turbu- 
lence being a key influenc e on star formation on small scales (e.g. 
iMacLow & Klesser]|2003l and references therein), the origin of 
this turbulence and its relevance for star formation on galaxy-sized 
scales is unclear. It is therefore of major interest to investigate em- 
pirically what global quantities correlate strongly with star forma- 
tion activity. It is logical, however, to expect that the star formation 
rate is a function of many variables. To investigate this dependence 
it is necessary to study the star formation rate densit y function i n 
various projections (see also the approach bv lLanzetta et all2002l) . 
This is also becoming a favoured approach to analyse theoretical 
models (e.g. lSpringel & Hernauisi2003tlYang et all2003l) . 

We will pursue this goal here by trying to tie together detailed 
information on individual galaxies with their contribution to the 
overall cosmic star formation activity. This is possible both through 
studying a snapshot in time and by comparing current and past av- 
erage star formation distributions. On a global scale, this provides a 
nearly model-independent complemen t to the studies of the c osmic 
spectrum bv lBaldrv et all J2002h and lGlazebrook et alJ <2003h . 

This kind of study requires a survey that spans a large range in 
galaxy properties and with accurate photometric and spectroscopic 
inform ation. The Sloan Digital Sky Survey (SDSS lYork et alJ 
2000) is an ideal candidate for this kind of work. Its accurate pho- 
tometry allows good characterisation of the selection function and 
the structural parameters of galaxies. The large database of accu- 
rately calibrated spectra allows a much more careful study of the 
spectral properties of galaxies than has hitherto been possible. 

With such unprecedented quality of data it is pertinent to im- 
prove modelling over what has generally been done before. One as- 
pect in particular that is generally missing in earlier work is a rigor- 
ous treatment of line formation and dust. This is especially impor- 
tant for the SDSS as it spans a large range in luminosity so the phys- 
ical parameters of the galaxy can be reasonably expected to vary. 
In this paper we have therefore built on the work of Chari ot et alJ 
(2002) and fit all strong emission lines simultaneously to simulta- 
neously constrain these physical properties. 

Our paper is structured as follows: We start in section [2] by 
briefly describing the data used. Section[3]gives an overview of the 
models and the modelling needed for our analysis. This includes 
a discussion of the relationship between Ha-luminosity and SFR. 
Section|4]discusses in some detail the influence of AGN activity on 
our estimates of star formation rates (SFRs). Section|5]discusses the 



very important question of aperture effects in a fibre survey such as 
the SDSS and sets out our method for removing these. With these 
preliminaries out of the way, we discuss the local SFR density in 
Section[6]and give an inventory of star formation in Section|7| The 
specific star formation rate and its variation with mass is discussed 
in section[S|and the variation with other physical parameters is cov- 
ered in Section [5] The paper is concluded in Section fTol We use 
a cosmology with Ho — 70km/s/Mpc, Am = 0.3, Q,a ~ 0.7. 
Where appropriate we define h = f/o/100km/s/Mpc. We use the 
Kroupa 1 2001 ) universal IMF. To convert to the popular choice of a 



Salpeten l 19551) IMF between 0. 1 and IOOMq , one should multiply 



our SFR estimates by 1.5, the ratio of mass in the two IMFs for the 
same amount of ionising radiation. We will also make use of stellar 
mass estimates below and to convert these to a Salpeter IMF it is 
again necessary to multiply the masses by 1.5. 



2 THE DATA USED 



The Sloan Digital S ky Survey (SDSS) jYork et alJ fcOOOh : 
Stoughton et al 1 120021) ) is an ambitious survey to obtain spectro- 
scopic and photometric data across n steradians. The survey is con- 
ducted using a dedicated 2.5m telescope at Apache Point Observa- 
tory. The photometry is obtained using drift- scanning with a unique 
CCD camera iGunn et alll9 981. allowing near- simultaneous pho - 
tometry in five b ands (it, q, r, i, z, iFukueita et al] 119961) : 
iHogg et alJ l200lh : ISrnith et all i20o3) 1. The resultin g data are re- 
duced in a dedicated photometric pipeline, photo iLupton et alJ 
2001) and astrometrically calibrated l Pier et al. 2003). We use the 
results of photo v5.4. To estimate total magnitudes we use 
the cmodel magnitud es recommended for use by the DR2 Paper 
I Abaz aiian et all2004) . This is a weighted combination of the ex- 
ponential and de Vaucouleurs fits to the light profile provided by 
photo and has the advantage over Petrosian magnitudes that they 
are higher S/N (cf. the discussion in lBaldrv et aljl2003h . We have 
also performed all calculations with the Petrosian total magnitudes 
and found that the results in the paper change if these are used al- 
though we get a few more outliers. Fiber magnitudes in g, r, i are 
measured directly from the observed spectrum and the r-band mag- 
nitude is normalised to the fiber magnitude of photo . When es- 
timating fixed-frame magnitudes we follow iBlanton et all (2003b ) 
and k-correct our magnitudes to z — 0.1 to minimise the errors on 
the k-correction. We will also adopt their notation and refer to these 
redshifted magnitudes as 0A g, 01 rand 0,1 i. 

The spectroscopic observations are obtained using two fiber- 
fed spectrographs on the same telescope with the fiber place- 
ment done using an efficient tiling algorithm I Bl anton et all2003T) . 
The spectroscopic data are reduced in two independent pipelines 
spectrold (SubbaRao et al, in preparation) and specBS 
(Schlegel et al, in preparation) — we use the specBS redshifts, 
but the differences between the two pipelines are entirely negligi- 
ble. 

The dataset used in the present stud y is based o n the main 
SDSS ga laxy sample, described bv lStrauss et alJ 120021) . We use a 
subset of Blanto net alJ l200 3b)'s Sample 10 consisting of 149660 
galaxies with spectroscopic observations, 14.5 < r < 17.77 and 
0.005 < z < 0.22. This encompasses slightly more galaxies than 



1 For those galaxies where we cannot measure fibre magnitudes from the 
spectra we adopt the fibre magnitudes from photo. This has no effect on 
the results in this paper. 
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the SDSS Data Release 1 jAbazaiian et all2003l) . The overall con- 
tinuum properties of this s ample are discussed in some detail by 
Kauf fmann et alJ <2003ibh and the sample selection and spectrum 
pipeline analysis are discussed by Tremonti et al. 1 2003). The spec- 
trophotometric calibration we use will also be used in Data Release 
2 jAbazaiian et all2004l) . We make extensive use of s tellar masses 
for the galaxies taken from Kauffmann ct al. 1 2003a). To be con- 
sistent with the latter we will throughout use th e narrow definition 
of the 4000A-break from Bal ogh et all ll998t) and denote this as 
D4000. 

As detailed in iTremonti et alJ 1200 3l) we have chosen to re- 
analyse the ID spectra using our own optimised pipeline. This al- 
lows us to take more care in the extraction of emission line fluxes 
than is done in the general purpose SDSS pipeline. The key differ- 
ences for the work presented here are as follows: we remove the 
smear exposures so that the spectra are uniformly exposed within 
a 3" fibre 2 and we use the latest high-res olution population syn- 
thesis models by Bruzual & Chariot 1 2003) to fit the continuum us- 
ing a non-negative linear-least- squares routine lLawson & H anson 
1974). This provides an excellent fit to the continuum and lets us 
perform continuum subtraction with unprecedented accuracy. The 
fitting procedure automatically accounts for weak metal absorp- 
tion under the forbidden lines and for Balmer absorption (see e.g. 
iBruzual & Chariot! [2003 ) for examples of the quality of fit). 

The ability to extract very weak line emission turns out to be 
very important when studying trends with mass (and hence increas- 
ing continuum contribution) and when estimating dust attenuation. 
We will see below that it is also important to identify possible AGN 
contamination of the emission lines. 

A crucial aspect of the SDSS for studies of the nearby galaxy 
population is that it has a w ell-defined and well-studied selection 
function IStrauss et alJl20o3) and covers a large range in absolute 
magnitude. This means that we can use the galaxy sample not only 
to study individual galaxies, but to construct distribution functions 
for volume-limited samples with high accuracy. Note that one can 
usually extract trends from a judiciously chosen, but statistically 
poorly characterised sample, but the distribution of a parameter re- 
quires a statistically well-defined sample. We will therefore focus 
our attention on distribution functions throughout our analysis. 

We will limit our study to galaxies with 0.005 < z < 0.22. 
The lower limit reflects our wish to include galaxies of the lowest 
possible luminosity. At the same time, we wish to avoid redshifts 
where deviations from Hubble flow are substantial and the acces- 
sible volume is very small. At the low redshift limit, a galaxy with 
the mass-to-light ratio appropriate for a 13.6 Gyr old stellar pop- 
ulation and r = 17.77 would have a stellar mass just below 10 8 
solar masses. We should therefore be complete to this mass limit 3 . 
Our sample can thus be used to reconstruct the properties of a vol- 
ume limited sample of galaxies with M* > 10 s M®. As we will 
see below, we also include the vast majority of all star formation in 
the nearby Universe. 

2.1 Sample definitions 

We will attempt to treat the entire sample in a uniform way in our 
SFR calculations. It will, however, prove necessary in the follow- 
ing discussion to define several subsamples of objects based on 

2 This is the default procedure in DR2. 

3 Dust attenuation should be of minor importance in this estimate given the 
observed low attenuation in old stellar systems. 
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Figure 1. The distribution of the galaxies in our sample in the BPT line- 
ratio diagram. The two lines shows the division of our sample into the three 
subsamples discussed in the text. An unweighted version of this diagram 
can be seen in Figure[9] The galaxies plotted here have S/N> 3 in all four 
lines. 



Subsample 


Number 


Percent 


Percent of mass density 


All 


146994 


100.0% 


100.0% 


SF 


39141 


26.6% 


20.6% 


C 


14372 


9.8% 


11.3% 


AGN 


8836 


6.0% 


7.1% 


Low S/N LINER 


11752 


8.0% 


9.5% 


Low S/N SF 


29115 


19.8% 


20.0% 


UnClass 


43778 


29.8% 


31.6% 



Table 1. Basic data about the subsamples discussed in the text. 



the ir emission line prop erties. These are defined on the basis of 
the Baldwin et al. 1 198 1 , BPT) diagram shown in Figure □ The di- 
agram has been divided into three regions which we discuss further 
below. 

Although a classification can be made on the basis of FigureQ 
regardless of the signal-to-noise (S/N) in the lines, we find that re- 
quiring S/N> 3 for all lines is useful. Below this limit, a rapidly 
increasing fraction of galaxies have negative measured line fluxes 
and the non-symmetric distribution of galaxies along the y-axis of 
the BPT diagram leads to classification biases. We have adopted 
the following subsamples (cf. Table0: 

All The set of all galaxies in the sample regardless of the S/N of 
their emission lines. 

SF The star forming galaxies. These are the galaxies with S/N> 3 
in all four BPT lines that lie below our most conservative AGN 
rejection criterion. As discussed in section|4| they are expected to 
have very low (< 1%) contribution to Ha from AGN. 

C The objects with S/N> 3 in all four BPT lines that are between 
the upper and lower lines in FigureQ We refer to these as composite 
galaxies. Up to 40% of their Ho-luminosity might come from an 
AGN. The lower line is taken from Kauffmann et al. ( 20033) and 
is a shifted version of the upper line. It does lie very close to an 
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empricial determination of the SF class (see section |4| below) but 
we have kept the Kauffmann et al line to be consistent with that 
work. No results below are affected by this choice. 

AGN The AGN population consists of the galaxies above the up- 
per line in Figure Q This line corresponds to the theoreical upper 
limit for pure starburst models so that a substantial AGN contribu- 
tion to the line fluxes is required to move a gala xy above this line . 
The line has been taken from Equation 5 in Kewl ev et alJ<2001bh . 
but our models have an identical upper limit. 

Low S/N AGNs A minimum classification for AGN galaxies is 
that they have [N II16584/Hq > 0.6 (and S/N> 3 in both lines) 
(e.g. Kauffmann et al. 2003c). It is therefore possible to classify 
these even if [O III]5007 and/or H/3 have too low S/N to be usefu l 
(cf. Figure [5). A similar approach is taken by Miller etall 1200 3l) . 
In general, we will include these galaxies together with the AGN 
class. 

Low S/N SF After we have separated out the AGN, the compos- 
ites and the low S/N AGNs, we have thrown out most galaxies with 
a possible AGN contribution to their spectra. The remaining galax- 
ies with S/N> 2 in Ha are considered low S/N star formers. We can 
still estimate the SFR of these galaxies from their line strengths, 
even though we cannot use the full modelling apparatus described 
in the following section. 

Unclassifiable Those remaining galaxies that are impossible to 
classify using the BPT diagram. This class is mostly made up of 
galaxies with no or very weak emission lines. 

Note that this is merely a "nuclear" classification — it does 
not make any statement about the properties of the parts of the 
galaxy outside the region sampled by the fibre. In particular, we 
expect the Unclassifiable category to include a substantial number 
of galaxies where we only sample the central bulge — for such 
galaxies there may be considerable amounts of star formation out- 
side the fibre. We will return to this point when discussing aperture 
corrections below. 

Figure|2|shows the distribution of log S/N in each line consid- 
ered for the galaxies in our sample. All galaxies falling outside the 
plotted range have been included in the three bins at the extremes 
of the plot. The thick solid line shows the overall S/N distribution 
and the distributions for each of the different classes defined above 
are shown with different line styles as indicated by the legend in 
the first panel. For clarity, the low S/N AGNs have been grouped 
with the AGN. 

These distributions are of considerable interest to understand 
possible biases in our classification scheme. The most notable re- 
sults from these panels can be summarised as follows 

• The SF & C classes are limited by the strength of the [O 
IIl]5007-line (see the sharp edge for the thick dashed line at S/N=3 
in the [O IIl]5007-panel). This is because the low S/N SF&C galax- 
ies tend to be in the bottom right of the BPT diagram where [O 
IIl]5007<H/3. 

• Since the AGN class has [O IIl]5007>H/3 the main limitation 
for the AGN class turns out to be the S/N in H/3. This is why using 
a AGN definition that only depends on Ha and [N II]6584 can help 
us identify a larger number of AGN. 

• There is clear evidence that there is a considerable number of 
SF galaxies that are thrown out because they have low S/N in [O 
III]5007. The inclusion of the low S/N SF class is therefore very 
beneficial. As Table Q shows this almost doubles the number of 
galaxies for which we can use Ha to determine the SFR. 

Our sample covers a large range in galaxy luminosity and the 
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Figure 3. The median signal-to-noise ratio for five of our lines that can be 
observed over the entire redshift range spanned by our sample. Notice in 
particular how [O III]5007 goes from being very strong at low masses to 
being the weakest line at high masses. 

distributions shown in Figure [2] suppress this dimension. A com- 
plementary view of these data is shown in Figure|5] This shows the 
trend in the median S/N of the different lines as a function of the 
stellar mass of the galaxies. We have suppressed the results for the 
[O II] 3727 line as it falls out of the sample at low redshift and also 
lies in a region of the spectrum that has rather different noise char- 
acteristics to that where the other lines reside. It shows a similar 
trend to to [O III]5007. 

The most important point in this figure is that the fraction of 
galaxies for which we can assign a nuclear classification is a de- 
clining function of mass. With any fixed S/N cut, the unclassifiable 
category will be biased towards more massive galaxies. 

The decline in S/N in the lines with mass is seen at M* > 
lO lo A/0. For lower masses, we note that the S/N in Ha is constant 
over a large range in mass. We will return to this point later. [O 
III]5007 changes from being the strongest line at low mass to being 
the weakest at high mass. This is caused by the decreasing electron 
temperature. At high mass, most of the cooling takes place in mid- 
IR fine-structure lines, depressing the [O IIl]5007-flux strongly. At 
the very highest masses the trend reverses beca use of the higher 
fractio n of AGN in the most massive galaxies (c.f. lKauffmann et"all 
l2003ch . The [N II]6584 and [S II]6716 lines increase in strength as 
a function of mass because of the increasing metallicity (see the 
discussion in CL01). At high masses, the [N Il]6584-line is similar 
in strength to the Ha-line, consistent with an increasing fraction of 
LINERs at high masses (c.f. lKauffmann et all2003ch . 



3 MODELLING 

Our aim is to use the data presented in the previous section to con- 
strain the physical properties of the galaxies in our sample. The 
main emphasis in this paper is on the SFR. In the following we 
will sometimes distinguish between SFRs derived directly from the 
emission lines as discussed in this section and those derived indi- 
rectly from the 4000A-break as discussed in section |4~TI The for- 
mer we will refer to as SFR C and the latter as SFRd. In the major- 
ity of the paper we will use SFR C for some galaxies and SFRd for 
others and in that case we will use SFR for simplicity, the details 
will be given towards the end of section|5| 

We will build on the methodology of ICharlot et alj (2002, 
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Figure 2. The distribution of signal-to-noise for the six lines that we include in our fits. The thick solid line shows the overall distribution and the distributions 
for the different sub categories are shown with different line styles as indicated in the legend. The objects with S/N in the emission line outside the displayed 
range are included in the extreme three bins of the distributions. 
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hereafte r C02) and model the em ission lines in our galaxies 
with the lCharlot & Longhettll fcOOll CL01) models. These com- 
bine galaxy evolution models from iBruzual A. & Chariot! \ 19931 
version BC02) with emission line modelling from Cloudy (Fer- 
land 2001; see CL01 for details). These are similar in spirit to 
other models recently presented in the litera t ure {K ewley et al 

[ 2001 at IZackrisson et all l200lt iBressan et ail l2002t iMov et al 
2001 ; Panu zzo et alj20 03). but they contain a physically motivated 
dust model and are optimised for integrated spectra as opposed to 
individual HII regions. The modelling is discussed in more detail in 
ICharlot et alJ 120031 hereafter Paper I, see also C02); here we will 
only discuss those aspects important for our current undertaking. 

We have generated a model grid using the following effective 
(ie. galaxy- wide) parameters: metallicity, Z, the ionization parame- 
ter at the edge of the Stromgren sphere, U, the total dust attenuation 
in the l/-band, ry, and the dust-to-metal ratio of the ionized gas, f . 
The parameter ranges and grid sizes adopted are given in Table [2] 
Since we will only study relative line fluxes in this paper, we are 
not sensitive to stellar age, star formation history and the relative 
attenuation by dust in the stellar birthclouds (i.e. giant molecular 
clouds) and the ambient ISM. In total our grid contains ~2x 10 5 
models. 

Given the major uncertai nty in tra ditional star formati on esti- 
mates due to dust attenuation iSullivan etal j200 it iHopkins et alJ 
1200 it lBell&Kennicuttll200ll : ICalzettilll997l) , we should briefly 
summarise our dust treatment which follows that of Chariot & Fall 
(2000, hereafter CF00). The CF00 model provides a consistent 
model for UV to far-IR emission. It does this by incorporating a 
multi-component dust model where the birth clouds of young stars 
have a finite lifetime. This ensures a consistent picture for the at- 
tenuation of continuum and line emission photons. In addition the 
model deals with angle-averaged, effective quantities, which de- 
pend on the combined optical properties and spatial distribution of 
the dust. 

Each model in our grid has been calculated with a particu- 
lar dust attenuation and these attenuated line ratios are compared 
directly to the observed spectrum. This means that all lines con- 
tribute to the constraint on the dust attenuation. To first approxi- 
mation, however, our dust corrections are based on the Hq/H/3 ra- 
tio. Note that we do not interpret the derived dust attenuations as 
coming from a simple foreground screen, a procedure which has 
been rightfully criticised as simplistic (e.g. IWitt & Gordorll2000 ; 
iBell & KennicuufeOOltrSerieant et alj2002t) . 

This model grid is somewhat larger than that used by C02, 
but the main difference from C02 is that we adopt a Bayesian ap- 
proach to calculate the likelihood of e ach model given the data . 
This is similar to the approach used by K auffmann et alJ |2003a) 
in their calculation of stellar masses. If, for notational simplicity, 
we enumerate the models with a single index, j, we can write the 
likelihood of a model given the data, P(Mj \ {Li}), as 

^lincs 2 

i=0 

where the sum is over all lines observed for that galaxy, Li is the 
luminosity of line i, A is an overall scaling factor and M is the 
model grid — in practice this is a 4D array. The prior adopted is 
given in Tableland is essentially a maximum ignorance, flat prior 
for all parameters. The choice of prior has very little effect for the 
high S/N galaxies as long as we span the range of physical parame- 
ters in the sample. However, as we will explain below, it can make 
a difference for low S/N galaxies. 



We fit in linear flux units rather than to flux ratios, because the 
approximation of Gaussianity is more appropriate there. (By com- 
paring the observed ratio of the [Om] lines at 4959A and 5007A 
with the theoretical value 4 , we find that the errors on our flux mea- 
surements become non-Gaussian below S/N ~ 2. Since we limit 
our SF sample to S/N> 3, this is not a major concern) 5 . 

The important advantage of our approach is that it generates 
the full likelihood distribution of a given parameter and from this 
we can rigorously define confidence intervals for our estimates. 
For high S/N objects, these likelihood distributions are symmetric 
around a best-fit value. In this situation it is possible to summarise 
the distribution using the best-fit value and the spread around this 
value. However, for lower S/N galaxies, the distributions may be 
double-peaked or non-symmetric and it is necessary to keep the 
entire likelihood distribution in further calculations. The main dis- 
advantage of working with full distributions is that they are some- 
what cumbersome to manipulate. The basic theory is covered in 
introductory statistics books (e.g. Rice 1995), but for the benefit 
of the reader we have collected the results most important for the 
present paper in Appendix 1X1 

Figure |4] shows the result of taking a high S/N spectrum and 
decreasing the S/N of the emission lines. Each likelihood distribu- 
tion is the result of 100 realisations (assuming Gaussian noise on 
the line flux). The figure shows S/N= 2, 5 and 15, where the S/N is 
that of the weakest line of those contributing to the BPT diagram, 
for this galaxy this is H/3. 

The distributions broaden as expected when the S/N de- 
creases. There is also a slight shift in the estimates with decreasing 
S/N — in particular the estimate of the SFR C goes up by 0. 1 dex in 
the mode, somewhat more in average. Although the magnitude and 
sign of this shift depends on the particulars of the line fluxes of the 
galaxy, it is an indication that our apparatus does not function well 
at low S/N. For this reason, and because we cannot reliably classify 
galaxies at low S/N, we only use the results of the model fits for the 
SF class. 

Figure [5] shows the distribution of the difference between the 
observed value of the line fluxes and that of the the best-fit model, 
divided by the error for all galaxies in the SF class. The agreement 
is excellent except perhaps the models slightly overpredict the H/3- 
flux for the highest S/N objects. This could be caused by uncertain- 
ties in the continuum subtraction, but it has negligible effect on our 
SFR C estimates. 

This modelling approach is necessary to get accurate SFR C 
estimates because the SDSS spans a range in galaxy properties 
such as Hubble type, mass and emission-line characteristics. Pre- 
vious studies of star formation activity (see Kennicutt 1998, for a 
review) using optical spectra have usually assumed a fixed conver- 
sion factor between Ha luminosity and SFR C . Dust attenuation is 
estimated from Ha/K/3 assuming a fixed unattenuated Case B ra- 
tio. The fixed conversion factor appears to be a reasonable approx- 
imation when stellar absorption in the Balmer lines is taken into 
account (e.g. lCharlot et alj2 002V However, as is well known, and 
shown quantitatively by CL01, it is not correct in detail. The ap- 
proximation neglects the effect of metallicity and ionization state. 



4 The likelihood distribution of the ratio of two Gaussians is, of course, not 
Gaussian and we compare with the appropriate distribution for the ratio of 
two Gaussians. 

5 Few galaxies have S/N < 3 in [O II]3727 or [S II]67 16 when the S/N > 3 
in the other lines (1-2% for [O II]3727 and ~ 4% for [S II]6716). These do 
not affect our fits as the other lines dominate the fits for these objects. 
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Parameter 


Description 


Range 


Z 


The metallicity 


-1 < log Z/Z Q) < 0.6 in 24 steps 


u 


The ionisation parameter 


-4.0 < log U < -2.0 in 33 steps 


TV 


The total dust attenuation 


0.01 < t v < 4.0 in 24 steps 


5 


The dust-to-metal ratio 


0.1 < 5 < 0.5 in 9 steps 



Table 2. The model grid calculated for the present work. This is calculated for a constant star formation history at t = 10 yrs — see text for details. 




Figure 4. The likelihood distributions derived for model parameters and the SFR C as a function of the S/N of the weakest line (here H/3) for one high S/N 
galaxy. Each distribution is the average of 100 realisations with that S/N. The top left shows the likelihood distribution of log SFR C the top right the logarithm 
of the gas metallicity in units of solar. The bottom left panel shows the dust attenuation in the emission lines at V and finally the bottom right shows the 
logarithm of the ionization parameter at the edge of the Stromgren sphere. The galaxy is SDSS J234342. 75-001255. 9 at z = 0.026. 



It also neglects the diffuse emission in galaxies. More careful mod- 
elling is required in order to avoid creating biases in the SFR C es- 
timates as function of metallicity or stellar mass. 

However, in many situations, in particular at high redshift, de- 
tailed modelling in the way we have done is very hard to carry out. 
It is therefore of considerable interest to look at the relationship 
between the Ha-luminosity and SFR e found in our study. 

3.1 The relationship between Ha-luminosity and SFR e 

We will follow CL01 and denote the ratio of observed Ha- 
luminosity to SFRe as, r)Ha'- 

VH a = L HQ /SFR e . (2) 



In this form the Ha-luminosity is assumed to be uncorrected for 
dust attenuation. We now wish to study how this parameter varies 
with the physical properties of the galaxy. It will prove convenient 
to split this into the conversion factor between unattenuated Ha- 
luminosity and SFR C which we will refer to as r] Ha , and the dust 
attenuation at Ha in magnitudes, Ah a, which we will take as 

A Ha ~0.96r v , (3) 

This assumes an attenuation law of the form r(A) oc A~ 0,7 , which 
is a good approximation to the CF00 model over the wavelength 
range we consider. 

Figure |5| shows the likelihood distributions for Ah a in five 
bins in stellar mass. These have been constructed by co-adding 
the individual likelihood distributions for Ahc for all SF galaxies 
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Figure 5. The distribution of the difference between the observed value of the line fluxes and that of the the best-fit model, divided by the error for all galaxies 
in the SF class. The smooth dashed curve shows a unit Gaussian for reference. The horizontally shaded histogram shows the results for the galaxies that have 
S/N in the line between 3 and 10. The obliquely shaded histogram is for galaxies with S/N> 20 in the line. 
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Figure 6. The likelihood distributions of the dust attenuation as a function 
of stellar mass. These are derived from the model fits to the SF class. 



Figure 7. The likelihood distribution of the conversion factor between Ha- 
luminosity and SFR C , ffjj a - These were again derived from the model fits 
to the SF class. 



in the relevant mass range. We note that ignoring the metallicity- 
dependence of the Case B Ha/H/3 ratio would lead to an overesti- 
mate of the dust attenuation by up to ~ 0.5 magnitudes for the most 
metal rich galaxies. The most striking result from Figure |6] is the 
clear i ncrease in dust content at high stellar masses. Tremo nti et alJ 
(2003) has shown that there is a strong correlation between metal- 
licity and mass for star-forming galaxies. One would therefore ex- 
pect that the main physical reason for the correlation between dust 
attenuation and stellar mass is an increase in the metallicity of the 
emitting gas. It is also interesting that the width of the Ahc* dis- 
tribution increases strongly as one goes to more massive galaxies. 
Some of this is due to the more massive galaxies having lower S/N 
emission lines and hence more extended likelihood distributions. 
However, note that we require S/N> 3 for all four lines in the BPT 
diagram, so it is unlikely that this is the main cause. 

Figure Q shows the corresponding likelihood distributions for 
rf Ha . We have compared this to the conversion factor advocated 



bv lKennicutlll 19981) (nl,„ (Kennicutt) = lO 4128 erg/s/M /yr for 
our adopted IMF). Thi s value is in dicated as the vertical dotted line 
in Figure^ Clearly the Kennicutt 1 1998) conversion factor is a very 
good typical value and compares well with the median value for our 
sample log TJj^a — 41.27. However, the peak values of log 
do vary by nearly 0.4 dex, with the most massive/most metal rich 
galaxies producing less Ha-luminosity for the same SFR C than 
low mass/metal poor galaxies. This correlation is also likely to be 
driven in main by the changes in gas-phase metal licity with mass, 
but as discussed by CL01 and Charlot ~t alJ 12002T) a changing es- 
cape fraction for ionizing photons is also likely to contribute to 
these changes in rfij a . 
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3.2 A comparison to other estimators 

These trends with mass suggest that any simple recipes for con- 
version of Ha-luminosity into SFR C are likely to show some bi- 
ases with stellar mass. To quantify these possible biases, it is 
useful to compare our SFR C estimates with those obtained from 
previous methods to convert observed Ha- flux to SFRs (se e also 
ICharlot et all2002l) . We noted above that fhe lKennicuttl 1 19981) con- 
version factor is a good typical value so we will use this in what 
follows, ie. 

SFR C =L° Ha /10 41 - 28 , (4) 

where L° Ha is the estimated emission luminosity of Ha. Follow- 
ing the discussion above we will also fix the Case B Ha/H/3 ratio 
to be 2.86 (appropriate for an electron density of n = 100cm -3 
and electron temperature T e — 10 4 K; Osterbroc k 1989 ), where 
relevant below. We have also adopted the ISeatonl ll979l) attenua- 
tion law in the following. These choices have been made to ensure 
that we will use a method to estimate SFRs which is very s imilar 
to those commonly used in the literature (e.g. lSullivan et all2000h . 
The residual difference between methods therefore lie in the esti- 
mates of dust attenuation and stellar absorption under the Balmer 
lines. This leads to the following list of methods of SFR C estima- 
tion: 

(1) Assume that the observed flux can be converted to a lumi- 
nosity without any correction for dust or continuum absorption. 
This underestimates the SFR C , but might be reasonable for strongly 
star-forming low metallicity galaxies. 

(2) Correct for dust attenuation by assuming a default attenua- 
tion of Ay = lmag in the emission line and correct for absorption 
at Ha by assuming a typical stellar absorption, EW„(, s (Ha), of 2A. 
If the assumptions are correct in the mean for the sample this is 
likely to give a good estimate of the SFR c 's, but the scatter around 
the mean value might be large depending on the sample. 

(3) Improve on the dust corrections by using Ha/H/3 (or some 
other Balmer line combination) to determine the dust attenuation 
using a fixed dust-free Case B reference value. We can make two 
assumptions about the absorption underlying H/3: 

(a) We can assume that EW a i, s (Ha)=EW a ( JS (H/3), this is the 
most common assumption in the literature. 

(b) EW abs (Ua)= 0.6EW a6s (H/3). This is a better assump- 
tion for the BC03 models and presumably for our galaxies as 
well. 

(4) Finally, with good spectra such as the SDSS it is possible to 
perform a careful estimate of the absorption at Ha and H/3, which is 
what we have done for the present paper. These fluxes can be used 
to determine the dust attenuation as above and thence the SFR C . 

We compare these four estimates of SFR c 's with our best es- 
timates in Figure [8] To produce this figure we took all SF galax- 
ies with S/N in H/3> 10 (to ensure that dust attenuations from 
Ha/H/3 are reliable) and compared each of the methods 1-4 above 
to the mode of our likelihood distribution for the SFR c 's. The fig- 
ure shows the median of the logarithm of the ratio of the two SFR C 
estimates in bins of 0.2 dex in stellar mass. 

It is immediately clear that methods 1, 3b and 4 all agree well 
with our estimates for low mass galaxies. This is expected because 
the equivalent width of stellar absorption is very small compared 
to that of the emission line (the median ratio is 2-6% at log M* < 
10) and the dust attenuation is very low in these low metallicity 
galaxies. But it is clear that the assumption (method 3a) of equal 



absorption under Ha and H/3 leads to an overestimate of the dust 
attenuation and hence an overestimate of the SFR C . Method 2 does 
not perform well at low masses either, but this is again expected 
since an average dust attenuation of 1 magnitude at V is too high 
for these galaxies so again we overestimate the SFR C . 

As expected method 1 starts to diverge from the other esti- 
mators when going to more massive and dusty systems (cf. Fig- 
ure|6|. Likewise method 3a gives systematically discrepant results 
as explain above. Method 2 is similar to method 1 but agrees with 
the more sophisticated estimators in a different mass range since 
the underlying assumptions are more appropriate for more massive 
systems. 

Finally the remaining methods, 3b and 4, agree fairly well with 
the results of the full fits except at the highest masses. This is be- 
cause the assumption of a fixed i]^ a and Case B ratio for all masses 
is flawed as we noted in the discussion of Figure However it 
is interesting to note that the changes in the Case B ratio and in 
VHa w ' m mass nearly cancel when one estimates the SFR C over a 
large range in mass. This happy coincidence comes about because 
the standard Case B ratio will tend to overestimate the unattenu- 
ated Ha-flux, whereas ignoring the variation on rflj a will require a 
larger Ha-flux for the same SFR C . Thus it is a fairly good approx- 
imation to use a fixed Case B ratio and rjjja value, but one should 
never use only one and then apply the relations derived here. Also, 
it is necessary to include the uncertainty in the Case B ratio when 
quoting uncertainties, this uncertainty can only be accurately as- 
sessed using the full model fits. 

At the very highest masses it is however worth noting that 
ignoring these factors might lead to up to a factor of 3 systematic 
underestimate of the SFR c 's. It is therefore important to adjust the 
conversion factor and Case B ratio to that expected for the sample 
galaxies in which case the simple methods will work well. 

We should also note that similar trends are seen with the emis- 
sion equivalent width of Ha, with the result that method 2, 3b and 
4 all give SFR C estimates in good agreement with the full model 
fits when the Ha emision line has an equivalent width > 20A. 

As remarked above, there are a large number of galaxies with 
significant Ha which cannot be classified in the BPT diagram, the 
low S/N SF class. We would nevertheless still like to use the Ha 
line to constrain the SFR C of these galaxies. We do this by adopt- 
ing the average likelihood distribution for the set of high S/N SF 
galaxies with masses that lie within a factor of 3 of the mass of the 
object in question. The range is increased if it contains less than 50 
galaxies. 



3.3 Additional uncertainties 

There are several residual uncertainties in our determination of 
SFR C t hat warra nt discussion. First, we have considered only one 
IMF, the lKroupali200ll) universal IMF. The emission lines we con- 
sider here depend on the massive end of the IMF. As long as this 
does not vary dramatically, the effect of changing the IMF is mostly 
to change the overall normalisation o f our S FRc's. To correct from 
our choice of IMF to the ISalpeteil ll955l) IMF between 0.1 and 
IOOA/q, one should multiply our SFR C estimates by 1.5. This is 
the ratio of mass in the two IMFs for the same amount of ionising 
radiation. 

Second, our model assumes that essentially no ionising radi- 
ation escapes the galaxies (see CL01). This assumption is neces- 
sary in order to relate Ha-luminosity to SFR C and seems reason- 
able for our sample of relatively massive galaxies. Even local star- 
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Figure 8. Our estimates compared with the simpler SFR estimates discussed in the text as a function of stellar mass. We have used all SF galaxies with S/N 
H/3 > 10. The figure shows the logarithm of the ratio of our estimate of the SFR C to that obtained by the alternative method, so a value of zero corresponds 
to equality. See text for details. 



burst galaxies have e scape fractions < 10% iLeitherer et alll995l 
Heck man et all200ll see also CL01). 

Finally, our model uses angle-averaged, "effective" parameter 
values, whereas any particular galaxy is only seen from one angle. 
For a given galaxy, there may be a mismatch between the modelling 
and the observations. We have verified that no significant system- 
atic is present with respect to the inclination of the galaxies, but 
a residual scatter is likely to remain. This can not be quantified 
without rigorous modelling of the spatially resolved properties of 
galaxies. 

As explained in Paper I, additional uncertainties come from 
the mix of stellar populations within the fibre, from uncertainties 
related to interpolation of our model grid (1-2 %), and from uncer- 
tainties in stellar tracks and theoretical population modelling (~ 2 
%). We include these uncertainties below by adding a 4 % uncer- 
tainty in quadrature. 

The largest source of uncertainty in the estimates of SFR c 's 
within in the fibre, however, is that due to the possible contamina- 
tion of our spectra by other sources of ionising radiation. We now 
turn to a discussion of these. 



4 OTHER SOURCES OF EMISSION LINE FLUX 

As with any other non-resolved spectroscopic survey of galaxies, 
the emission line fluxes of the galaxies in our sample will contain 
a component due to sources not directly connected to star forma- 
tion. These include planetary nebulae (PNe), supernova remnants 
(SNRs), the diffuse ionized gas (DIG) and active galactic nuclei 
(AGN). The contributions of SNR, PNe and DIG are discussed in 
Appendix iBl where we show that they are of minor importance for 
our work. Here we will quantify the possible level of AGN contri- 
bution in each of the classes defined in section |2~T1 We will show 
that for the Composite and AGN classes, blind application of the 
modelling machinery described in the previous section is likely 
to lead to incorrect results. Note that a comprehens ive study of 
the AG N activity in our sample is presented in Ka uffmann et alJ 
l2003d) . 

There are at least three methods commonly employed to deal 
with the presence of AGN in a galaxy survey: 

(i) Remove galaxies with AGNs by cross-correlati ng the sam- 
ple with published AGN catalogues (e.g. ICondon et all l2002t 
ISerieant et all2002h . 

(ii) Identify galaxies with non-stellar ionising spectra using a di- 
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agnostic diagram, typic ally the so-called BPT diagram introduced 
bv lBaldwinetalll 19811) . This is possible so long as one can detect 
the lines required for classification (see section |2~TV At z > 0.5, 
where Ha is redshifted out of the optical th is method is less accu- 
rate (see however iRola et alll997l : lTresse et alll99rj) . 

(iii) Finally AGN can be subtracted in a statistical manner. 
This is necessary whe n no other methods are applicable (e.g. 
iTresse &M addox 1998). The method is, of course, sensitive to evo- 
lution in AGN activity with redshift and it is therefore necessary to 
include the related uncertainties in the error budget for the derived 
SFRs (e.g. lTresse et alfcOOllFlores et alll999h . 

All of these methods tend to classify galaxi es as ei ther AGN or 
starbursts, but as shown by Kauffman n et alJ 12003d) . many AGN 
have young stellar populations and ongoing star formation. 

Our analysis will be based on the BPT diagram. The left panel 
of Figure|5|shows the BPT diagram as a 2D histogram for galaxies 
in our sample with S/N > 3 in all lines. The shading is based on the 
square root of the number of galaxies in each bin and the grid spac- 
ing is 0.05 along the x and y-axes. The lines delineate our AGN, 
C and SF classes. The cross shows the location of an "average" 
AGN which is constructed by taking the mean of the luminosities 
of all AGN galaxies with sigma-clipping of outliers using a — 3.5. 
We use this AGN below, the results are robust to the exact average 
AG N used. 

iKewlev et alJ |2001b) advocate the use of several diagnos- 
tic diagrams to rigorously isolate AGNs. However, diagnostic di- 
agrams involving [S Il]6716/Ha or [O I]6300/Ha are much less 
effective at assessing the degree to which AGN may contaminate 
our estimated SFR C . This is because a change in AGN fraction 
moves a galaxy almost parallel to the SF locus in these diagrams. 
In the BPT diagram of Figure|5| a change in AGN fraction moves 
the galaxy perpendicular to this locus. 

The upper line in Figure [5] shows the uppermost region that 
can in principle be reached by systems ionized by normal stellar 
populations. However, real star forming galaxies (and ionization- 
bounded Hll-regions) populate a narrow sequence in the BPT dia- 
gram, because there are strong correlations between ionization pa- 
rameter, dust-depletion and metallicity. A galaxy with both an AGN 
and ong oing star formation will be located to the right of this line 
(see also lKewlev et alj2001bT) . 

The right-hand panel in Figure [5] shows the BPT diagram 
for the SF class, but this time as a conditional density diagram 
(i.e. the distribution in each bin along the x-axis has been nor- 
malised to unity). This distribution i s very similar to that found 
for Hll-regions in nea rby galaxies (e.g. lBresolin & Kennicuttl2002t 
iKennicutt et aljEoOfj) . The dashed lines show fits to the intervals 
enclosing 95% of the distribution. The fits use only the data with 
log[A f Il]6584/_ffa < —0.5. This ensures that the fits are the same 
if the AGN and C classes are included in the distribution. 

We will take the upper dashed line in the right hand panel 
as the upper limit for our pure star formation sequence. For each 
galaxy above this limit, we ask what AGN contribution needs to be 
subtracted to move the galaxy below the limit. The average AGN 
we use is indicated with a cross in the left-hand panel in Figure [5] 
The results are fairly robust to the exact location of this average 
AGN until one gets close to the AGN region. 

In Figure [Tol we show the results of this exercise for the com- 
posite class. The solid histogram shows the fraction of galaxies that 
have a given fraction of their Hq-Aux coming from an AGN. The 
dashed histogram shows the same for [O III]5007. At the top of the 
AGN plume in Figure^ all the [O IIl]5007-flux can be expected to 
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Figure 9. Left: The standard BPT excitation diagram for those galaxies 
in our sample with S/N> 3 in all lines used here (70189 in total). The 
grayscale shows the square root of the number of galaxies in each bin of 
0.05 in the x and y-axes. The cross shows the location of the average AGN 
used to assess the AGN contribution to the Ha flux. The lines are the same 
as those in Figure □ Right: The same plot after removal of the composites 
and AGN (leaving 42262 galaxies; cf. Table and after normalising the 
distribution in each bin along the x-axis to 1 . The thick dashed lines show 
fits to intervals containing 95% of the galaxies as described in the text. 
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Fraction of line in AGN 

Figure 10. The fraction of [O III]5007 and Ha luminosity estimated to 
come from the AGN in the composite galaxies (see text for details). The 
thick lines refer to Ha and the thin to [O III]50O7. 

come from an AGN, but we do not expect more than ~ 40% of the 
Ha-flux to have a non-stellar origin for any galaxy 6 . 

In summary, we find that 11% of the (observed) Ha- 
luminosity density in the composites is likely to come from an 
AGN. For [O III]5007 AGNs contribute 41% of the luminosity den- 
sity in the composites. This means that using the Ha-luminosity for 
the Composite class without any correction for AGN activity will 
only mildly overestimate the SFR C . 

Because different lines are affected by AGN in different ways, 
model fitting may give unreliable results. We therefore estimate the 
in-fibre star formation in the Composite and AGN classes using 
their measured D4000 values, as discussed below. 

We should also comment that the pure star forming sequence 
differs slightly from the definition of the SF class, but we have 
checked that no more than 0.2% of the Ha-luminosity density in 

6 This number depends somewhat on the average AGN spectrum adopted. 
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Figure 11. The observed relationship between D4000 and specific SFR C 
(both inside the fibre). The contour lines are equally spaced from 10 galax- 
ies per bin to the maximum. The bin size is 0.01 X 0.1 in the units given 
in the plot. The red line shows the average at a given D4000 and the blue 
line shows the mode of the distribution. The dashed lines show the limits 
containing 95% of the galaxies at a given D4000. 



the SF galaxies is expected to be of AGN origin, which is well 
below our systematic uncertainties and we will ignore this hence- 
forth. The AGN contribution to the [O IIl]5007-luminosity of the 
SF galaxies can also be shown to be well below the level necessary 
to bias our model fits. 



4.1 Estimating the SFR/M, from D4000 

For the Composite galaxies and AGN in the sample, we cannot 
use the model fits to determine the SFR c 's, because the line fluxes 
are likely to be affected by the AGN component. In principle it is 
possible to subtract off an AGN component, but in practice this is 
a very uncertain procedure. We have therefore decided to use the 
measured D4000 value to estimate the SFRs and will denote this 
SFR d . 

This method uses the relationship between SFR C /Af, and 
D4000 shown in Figure 1111 to estimate the specific SFRd of a 
galaxy, and from there its total SFRd. This relation shown in the 
figure has been constructed using the derived SFR C likelihood dis- 
tributions and the measured D4000 values (we assume Gaussian 
errors on the latter). The contours show the sum of the PDF in each 
bin. The red line shows the average of the distribution at a given 
D4000 and the blue line the mode. The dashed lines show the 95% 
limits at a given D4000. To derive the likelihood distribution of the 
SFRd of a given galaxy, we convolve the likelihood distribution in 
Figure^^with the likelihood distribution of D4000 for that galaxy. 

To minimise the possibility of biases we did not make a S/N 
cut when constructing Figure fTTI but we did throw out all AGN, C 
and low S/N AGN. To test the reliability of this procedure, we have 
checked that if we co-add the spectra of the low S/N galaxies in the 
SF region of the BPT diagram and run these spectra through our 
pipeline, we retrieve the same SFR c 's (within a couple of percent) 
that are obtained if we co-add the SFRd's derived for individual 
galaxies using the D4000 calibration outlined here. 

In what follows we will use SFRd for the AGN, C and Un- 
classifiable classes and SFR C for the SF and low S/N SF classes as 
our best estimate of the fibre SFR, and we will refer to this as the 
SFR. 



5 APERTURE EFFECTS 

The SDSS is a fiber based survey. At the median redshift of the 
survey, the spectra only sample ~ 1/3 of the total galaxy light. 
The presence of radial gradients in galaxy properties can there- 
fore lead to substantial uncertainties when one wants to correct 
to total quantities. The implications of these b iases have been 
discussed in some detail by several authors (e.g. I Kochanek etal 
| 200lj: | G6mez et alj2003llBaldrv et alj2002t|Perez-Gonzalez et al 
hoojjNakamura et alj2003l) . 

We now discuss our method for aperture correction which uses 
the resolved colour information available for each galaxy in the 
SDSS. We will show that by using this empirically based aperture 
correction we can remove the aperture bias. 

Our aperture correction scheme focuses on the likelihood dis- 
tribution P(SFR/Li|colour), i.e. the likelihood of the specific star 
formation rate for a given set of colours. It is essential to keep the 
entire likelihood distribution in order to avoid biases in our esti- 
mates, as the measured distributions of SFR/Li for certain colours 
turn out to be multi-peaked. We choose to normalise to the 0,1 i- 
band luminosity. This is the reddest band with uniformly small 
photometric uncertainties. We have verified that the results do not 
depend significantly on the choice of photometric band. 

In theory we could construct likelihoods for SFR/Li using all 
the colour information available. This has the disadvantage that 
even with our large sample, many bins have very few galaxies. 
The method is then sensitive to outliers. It turns out, however, that 
01 (g — r) and 01 (r — i) contain all the useful information neces- 
sary for our purposes. Adding other colours does not significantly 
improve our constraints, this is also true if we use the model mag- 
nitudes which give somewhat higher S/N in the u and z magnitudes 
iBaldrv et all2003l) . 

We therefore construct P(SFR/L; |colour) on a grid with bins 
of size 0.05 in 0A {g-r) and 0.025 in 01 (r-i). We add together the 
likelihood distributions for SFR/Li (inside the fibre) for all galax- 
ies in each bin (we exclude AGN and low S/N AGN, including them 
with their SFRs estimated from D4000 does not change the results). 
The result of this procedure is illustrated in Figure fT2l This shows 
P(SFR/L l |colour) for a subset of the full 01 (g - r), 01 (r - i) 
grid. The colour is indicated as 01 (g — r)/ 0,1 (r — i) in the top left 
hand corner of each panel and increases to the right for °' 1 (g — r) 
and upwards for (r — i). The number of galaxies in each bin is 
also indicated in each panel as well as the number of galaxies that 
have colours in this bin outside the fibre (see below). The thick and 
thin bars on the x-axes show the average and median SFR/Li for 
that bin respectively. Because of the non-symmetric nature of these 
distributions, the average differs noticeably from the median. 

The shape of the distributions varies strongly with colour and 
it is often not well represented by a Gaussian. In addition, higher 
values of SFR/Li often occur at redder 01 (r — i) colours. This 
is because the emission lines around Ha contribute significantly to 
the 01 i-flux during an episode of star formation. The likelihood 
distribution also becomes very wide for galaxies with 0,1 (g — r) > 
0.7, °' 1 (r — i) < 0.5 and their star formation rates are constrained 
to no better than a factor of 10. This poor constraint on SFR/Li 
is simply due to the degeneracy between age, metallicity and dust. 
This region of colour space is populated both by dusty star-forming 
galaxies and by galaxies with old stellar populations. 

We might consider improving our estimates by including the 
estimated gas-phase metallicity as a constraint. We cannot imple- 
ment this procedure for the full sample, however, because gas- 
phase metallicities can only be determined for the SF class. Includ- 
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Figure 12. The likelihood distribution of log SFR/L^ in bins of 01 (g — r) and 01 (r — i).. The 0,1 (<? — r) colour increases to the right and is indicated as the 
first number in the top left corner of each panel. The second number gives the 01 (r — i) colour and this increases upwards in the figure. The thick black mark 
on the x-axis shows the average SFR/L^ whereas the thin mark shows the median. Note that as one moves towards redder colours the distributions become 
highly non-symmetrical and there is a substantial difference between the average and the median. The number of objects contributing to the distribution shown 
in a given panel is indicated in each panel as N = X. Note also that this is a subset of the full 41 X 45 grid. 



ing the stellar mass as an additional constraint gives little apparent 
improvement. 

We have also verified that k-correction uncertainties do not 
appreciably affect our aperture correction estimates. We did this 
by calculating P(SFR/Li|colour) from galaxies with appropriate 
colours in a bin of width Az = 0.01 in redshift around the galaxy 
in question. This method uses only observed colours but agrees 
very well with the procedure using k-corrected colours. The dis- 
advantage of the model is that for a subset of the data there are not 
enough galaxies with the appropriate colours to construct reliable 
likelihood distributions. We have therefore settled for the simplest 
solution; we use only 01 (g — r), 01 (r — i) to constrain SFR/Li. 

We calculate the colour of the galaxy outside the fibre by sub- 
tracting off the fiber magnitudes from the cmodel (ie. total) mag- 
nitudes and we convolve this with P(SFR/ii|colour) to get an es- 
timate of SFR/Li outside the fibre. We require that at least 5 galax- 
ies contribute to the estimate of P(SFR/Li |colour) in a given bin. 
For the bluest colours, there are bins where no empirical estimate of 
P(SFR/Li|colour) exists, as few galaxies in the survey have very 
blue fibre colours. For these bins we use the closest bin. We have 



tested that this procedure does not introduce any significant bias 
into our estimates. For a few galaxies (176) this procedure failed 
because of photometric problems, these have been excluded from 
the further analysis. No noticeable differences would be found if 
they were included with an average aperture correction for their 
mass. 

We would like to emphasise that our method of aperture cor- 
rection is purely empirical. Our main assumption is that the distri- 
bution of SFR/Li for given 01 (g — r), 01 (r — i) colours is similar 
inside and outside the fibre. This assumption and the reliability of 
this procedure should clearly be tested using resolved spectroscopy. 
We are carrying out such a program and will report on it in an up- 
coming paper. 

For now, the easiest way to test our assumption is to calculate 
P(SFR/Li|colours) in bins of (,z/,z m ax) 2 , where -z max is the high- 
est redshift at which the galaxy in question would pass the sample 
selection criteria. This way we can compare P(SFR/I/i|colours) 
derived from galaxies where the fibre only samples the innermost 
regions with that derived from galaxies where the fibre sample 
a larger fraction of the galaxy. We have done this in 8 bins in 
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Figure 13. The value of the specific SFR, SFR/M, as a function of z/ z max . The solid line with dots show the trend within the fibre, ie. both M* and SFR 
calculated within the fibre and a clear aperture effect is seen. The dashed line shows the result when assuming that all the SFR takes place inside the fibre, ie. 
M* is for the galaxy as a whole and the SFR for the fibre only, and the dash-dotted line shows the result from our aperture corrections. Notice that we very 
nearly take out all aperture biases. 



(jz/z max ) 2 and compared the resulting predicted log SFR/Li for 
the sample galaxies in each of these bins with that derived from 
the total sample. There is a spread, reflecting the intrinsic scatter in 
the method, but for (z/z max ) 2 > 0.4 individual bins gave results 
in agreement with the overall distributions. This indicates that the 
distribution of SFR/Li at a given 0,1 (g — r), 01 (r — i) is fairly 
similar at different radii as long as we sample > 20% of the total r- 
band light (the median fraction of light within the fibre for galaxies 
with (z/z max ) 2 > 0.4). This is the case for majority of the sample. 

Figure 1131 shows that our method removes essentially all 
the aperture bias from the sample. This figure shows the median 
SFR/M* as a function of z/2 max in six different mass bins. All 
galaxies have been included. The specific SFR is estimated from 
the measured value of D4000 for the AGN and C classes (cf. Sec- 
tion The likelihood distributions of SFRAM have been co- 
added at each bin in z/ z max . 

In the absence of aperture effects (and evolution), this plot 
should show no trend with z/z max . Note that evolution is unlikely 
to provide more than 0.1 dex change over the range plotted in these 
diagrams. The dashed lines in Figure [T3l show the trend if one as- 
sumes that all star formation activity takes place within the fibre 



but we use the stellar mass for the whole galaxy. This will clearly 
underestimate the true SFR/ A/*. Figure [T3l shows that this quantity 
decreases towards lower values of z/,z max , as expected. 

The solid line with dots in the diagram shows SFR/M, when 
both quantities are calculated inside the fibre and it is clear that 
there are still strong aperture effects for galaxies with log M, > 
10.5. This is expected since these galaxies often have prominent 
bulges, in which the specific SFR is expected to be low. For lower 
mass galaxies, the aperture problems are considerably smaller and 
it is clear that a simple scaling of the fibre SFR by the r-band flux, 
as done for example bv lHopkins et alj 120031) is an acceptable aper- 
ture correction. This is also the case for galaxies that show no sign 
of star formation activity when more than a third of the total r-band 
light is sampled. It is not correct for other galaxies. 

Finally, the dot-dashed lines show the result after applying our 
aperture corrections. Our correction has removed the aperture bias, 
at least in a statistical way. We note that the spread in SFR/M, at 
a given z/z m3 x is substantial, typically 2 orders of magnitude. We 
have also tested whether the distribution of points is the same at 
all z/-z max after application of our aperture corrections. We find 
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Figure 14. The distribution of the errors on the SFR estimates. The line with 
solid points show the uncertainties derived from the likelihood distributions 
inside the fibre. The line with crosses points shows the same for the like- 
lihood distributions for the total SFR derived as described in the text. We 
have grouped together all galaxies with log SFR/Af, < —12 into one point 
and placed this at log SFR/M, = —13. The x-axis shows log SFR/Af,,, 
measured inside the fibre for the solid points and total for the crosses. The 
points for the total SFR/Af* are offset slightly for clarity. 



SFR measured inside the fibre at high SFR/M* because the aper- 
ture corrections are significantly more uncertain than the SFR esti- 
mates from the spectra. The spread in the uncertainty estimates is 
also increasing for increasing SFR/M* because the outer parts of 
these galaxies are very blue and few galaxies are available for the 
calibration of the aperture corrections at these very blue colours 
(cf. Figure fT2l . The median uncertainties on the total SFR/Af, are 
moderate, however, since the aperture corrections are in average 
well constrained as Figure fT2l shows. 

At low values of SFR/Af, the total and fibre estimates have 
comparable uncertainties 7 . This is because the aperture corrections 
for these galaxies are calibrated by the fibre estimates with similar 
and broad likelihood distributions. The total log SFR therefore gets 
a very similar likelihood distribution. Some galaxies with poorly 
constrained log SFR in the fibre might nevertheless get better con- 
strained log SFR total because they have bluer outsides than insides 
and hence better constrained aperture corrections. 

The best test of our error estimates for the fibre SFRs is re- 
peat observations. Approximately 5000 of our targets have been 
observed at least twice during the SDSS survey. We have used these 
duplicate observations to check that the errors we derive for the 
fiber SFRs are consistent with the dispersion between repeat obser- 
vations. 



that that it is, and that Figure fT3l is an appropriate summary of the 
process even though it only shows the median. 

At this point, it is useful to summarise the SFR indicators that 
are used for each subclass. Inside the fibre we have the following 
situation: 

(i) For the SF class we use the results of the full model fits. 

(ii) For the Low S/N SF class we use the observed Ha- 
luminosity and convert this into a SFR using the r/Ha distribution 
functions derived for the SF class (section lTTl . 

(iii) For the AGN, Composite and Unclassifiable classes we 
use the D4000 value to estimate SFR/Af, and SFR using the 
method outlined in Section l4~T1 

Outside the fibre, we always use the colour method to estimate the 
SFR. 

The uncertainties on our individual galaxy SFR estimates are 
summarised in Figure fT4l To construct this figure we have cal- 
culated the 68% confidence interval for the log SFR estimate for 
each galaxy and calculated the spread in this value in bins of log 
SFR/Af* (where the latter is taken to be the median of the distribu- 
tion, but the results are the same with the mode or the average). All 
galaxies with log SFR/Af, < —12 have been placed in one bin at 
log SFR/M, = -13. 

The solid points show the median 68% confidence interval in 
dex for the estimate of log SFR inside the fibre for each bin in log 
SFR/Af*. In the case of Gaussian errors this would correspond to 
2a. The error bars show the 68% spread in error estimates within 
that bin. The crosses show the same for the log SFR estimated after 
aperture corrections (i.e. total) and are slightly offset for clarity. 

Most of the trends are easy to understand. Firstly the SFR es- 
timates have lowest uncertainty at high SFR/Af, because the S/N 
is high in most lines and tight constraints can be obtained. For 
low values of SFR/Af*, the SFR inside the fibre is estimated from 
the D4000 value. This causes the error estimates to rise because 
D4000 provides a poorer constraint on SFR/Af,. Finally, the error 
decreases when the D4000 is so large that there is no longer any 
room for significant star formation. 

The uncertainties on the total log SFRs are larger than for log 



6 THE LOCAL SFR DENSITY 

We now turn to estimating the total SFR density of the local uni- 
verse. As discussed above, we have a likelihood distribution for the 
SFR for each galaxy. This includes both uncertainties in the model 
fit and the aperture correction. We also need to make a correction 
for evolution in the SFR over the 2.5 Gyr spanned by our redshift 
limits. We do this by assuming that the SFRs of all galaxies evolve 
as: 

SFR(z) oc (1 + zf. (5) 

For the integrated SFR density, the favoured evolution exponent up 
to z ~ 1 is currently /3 ~ 3 jGlazebrook et all2003tlHogg|200il) . 
although substantial uncertainty exists. We will therefore adopt this 
form for individual galaxies as well. We will calculate our results 
at z = 0.1. The differences between /3 — and /3 — 3 are then 
fairly small (~ 6% , see Table 3 below). Note that a single evolu- 
tionary correction for individual galaxies is likely to be wrong. As 
we will see below, there are clear differences in the star formation 
histories of galaxies of different mass, so although the integrated 
star formation density evolves as a power-law, the individual SFRs 
cannot show the same evolution. Fortunately, most of the results in 
the following sections are not dependent on the evolution within 
the sample. 

Our results for the total SFR density are summarised in Ta- 
ble This table gives the mode of the likelihood distribution of 
Pspr as well as the 68% confidence intervals. We show the result 
for P = on the top line and the following lines all assume j3 — 3 
— the relative contributions of the different classes do not depend 
on p. To estimate the star formation density of the survey, we add 
up th e likelihood distributions, using 1 /V ma x as a weight jFeltenl 
1976). As discussed in Appendix^ the summation is carried out 
with a Monte Carlo method. To calculate errors on the final SFR 

7 If we had plotted the uncertainty on the SFR rather than log SFR this 
would of course not be the case. 
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Table 3. The best estimate of the star formation density at 2 = 0.1 from our survey. We show the mode and the 68% confidence intervals as well as the 
percentage contribution to the total for the different classes. The total for j3 = (no evolution) is shown on the first row, the others use /3 = 3. 
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0.507 


0.509 
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1.974 


1.998 


100.00% 


SF 


0.317 


0.320 


0.324 


1.001 


1.018 


1.024 


51.58% 


Low S/N SF 


0.063 


0.064 


0.065 


0.445 


0.448 


0.464 


22.69% 


Composites 


0.078 


0.080 


0.092 


0.203 


0.207 


0.210 


10.48% 


AGN 


0.026 


0.026 


0.028 


0.082 


0.084 


0.086 


4.26% 


Unclassifiable 


0.019 


0.019 


0.020 


0.213 


0.216 


0.228 


10.96% 



density, we carry out 100 bootstrap repetitions. For each repetition 
we draw 149660 objects from our sample with replacement which 
makes up the bootstrap sample. For this sample we carry out 30 
Monte Carlo summations as discussed in Appendix^ — the results 
do not change when these numbers are increased. We then calculate 
the mean, mode, median and confidence intervals from the distri- 
bution of results obtained for each bootstrap repetition and Monte 
Carlo summation. The confidence intervals reflect only random er- 
rors. The systematic uncertainties are substantially larger. 

It is, of course, difficult to estimate accurately the systematic 
uncertainties. The first is the uncertainty in the estimator used to 
calculate the SFR inside the fibre for the Composite, AGN and Un- 
classifiable class. We have experimented with various estimators 
before settling on D4000. They all agree fairly well with a spread 
of 2% which has to be taken as a systematic uncertainty. 

Our SFR density estimate is sensitive to our aperture correc- 
tion scheme. We remarked above that our main assumption is that 
the distribution of log SFR/ Li is the same at a given colour inside 
the fibre as outside. This is not necessarily so, something which 
is hinted at in the clear difference between the mode, median and 
average for the likelihood distributions. These indicate that the like- 
lihood distribution for a given colour might be composed of at least 
two components and the balance between these could be different 
in the outer regions of the galaxy, cf. the double-peaked nature of 
some distributions in Figure fL?! It is therefore possible to take the 
difference between the average (the formally correct estimate to 
use) and the mode as indicating the importance of this assumption. 
We have therefore carried out the aperture corrections also using 
the mode as an estimator of log SFR/Li. In this case we find a 
SFR density 15% lower than that in Table|3] 

In addition there is a 6% difference between /3 = and j3 = 
3 which we take as an additional systematic uncertainty of ±3%. 
Finally differences between the linear interpolation scheme we use 
to draw numbers from the likelihood distribution and higher order 
schemes also produce a scatter of 1-2% percent in the calculation 
of the average. We will therefore adopt 1^% as indicative of the 
expected range of the systematic uncertainties. 

This gives our best estimate for the total SFR density at z — 
0.1 as 



Psfr = 1.915lo.oi( ran d-)_o.42( s y s - 



(6) 



in units of 10 -2 I170 Mo/yr/Mpc 3 . The random errors correspond 
to the 68% confidence interval, while the larger systematic error 
is the expected total range. We have here taken the average of the 
= 3 and /3 = estimates since this difference is included in the 
systematic uncertainties. 

We have calculated the SFR density in bins of absolute 01 r- 
band magnitude and this is plotted as solid circles in Figure [TBI For 




Figure 15. The star formation density as a function of r-band absolute mag- 
nitude compared with the r-band luminosity density scaled to have the same 
peak value. The error bars on the SFR density are derived from bootstrap re- 
sampling. The error-bars on the luminosity density estimate are suppressed 
for clarity. 



comparison, we plot the r-band luminosity density for our sample 
(see iBlanton et ai1l2003bl for a more in-depth discussion) scaled 
to the same peak va lue as the SFR d ensity (crosses). If we fit both 
distributions with a (Schechtet 1976) function of the form 



4>(L)dL = </>, 



°(£l s 



-L/L: 



dL 

17' 



(7) 



we find that the faint-end slopes, ofs, are the same for the two 
functions, but that L» is 0.27 magnitudes fainter for the star forma- 
tion density. The error bars on the SFR density are derived from 
bootstrap resampling of the sample and do not include systematic 
errors. These will to good approximation only affect the overall 
normalisation, and not the shape. 

The curves are very similar in shape. At faint magnitudes 
where the majority of galaxies are strongly star-forming the r-band 
light is a good tracer of the SFR activity, but low luminosity galax- 
ies are somewhat more important for the overall SFR density than 
for the r-band luminosity density. At the bright end of the distribu- 
tion, there is less star formation per unit r-band luminosity. We will 
see later that this is even more prominent when plotted against the 
stellar mass. 

Ver y little of the t otal SFR density occurs outside our sample 
(see also lGlazebrook et alj2003l their Figure 10). When integrat- 
ing up the Schechter function, we find that only 1-2% of the total 
SFR density originates outside our selection limits. We have there- 
fore decided not to correct our values for incompleteness. 
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Figure 16. The star formation density in the present study. The rilled cir- 
cle is our preferred estimate with the systematic and random errors in- 
cluded in the error bar. This is compared with several other recent de- 
termi nations of the local SFR. The papers indicated in the legend are: 
H00:lHaarsma etal]l200cli. C02: ICondon et al|l2002l). P01:|Pascual et alj 
1200 ll). TM98:lTresse & Maddoxl ll998l). G95:lGallego et alJ ll995l) . SOO: 
ISullivan et alj l200d) , Se02:ISerieant et ai]<2002l). S Ne: SFR derived from 
SNe are as reported by Fukugita & Kawasaki 1 2003 ) who are also respon- 
sible for the upper limits from neutrino observations at SuperKamiokande, 
G03: Glazebrook et al. 1 2003) for whom we show the full range of the Ha- 
derived local SFR. The dotted line shows a (1 + z) 3 evolution of psfr for 
comparison. All values have been corrected to the Krouria 1200 lit universal 
IMF and h = 0.7. 



Figure fT6l places our estimate together with other recent es- 
timates from the literature. In order to make this comparis on we 
have adjusted the SFR density reported using the Sal peteJ 1 19551) 
IMF with a lower mass limit of 0.1 Mq and an upper mass limit 
of IOOMq to our IMF by dividing all values by 1.515. All values 
have been corrected to h — 0.7, but no attempt has been made to 
adjust the values for small differences in the assumed cosmological 
model. 

We have indicated different methods of estimating the SFR 
using different colour s in the figure. It is clear th at radio deter- 
minat ions lHaarsma et all200ollCondon et all2002tlSerieant et all 
120021) typically give the highest estimates, although alternative cal- 
ibrations might bring these down somewhat (Hopkins et al. 2003). 

Our value se ems to be in good a greement with other Hq- 
based estimates jGallego et alj I l995t iTresse & Maddoxl Il998t 
iGlazebrook et alJl2003l) . In particular our valu e is in very good 
agreement with that derived for the SDSS by Glazebrook et al. 
(2003) in their analysis of the cosmic spectrum. We show the full 
range of their Ha-based estimate for the local SFR in Figure fT6l 
and it is clear we are in very good agreement despite the different 
approaches taken. It might seem surprising that our error-bar is no 
smaller than many other determinations, which have used samples 
that are two orders of magnitude smaller than ours. This is mostly 
due to the adoption of a rather conservative systematic error for the 
aperture corrections. 

Finally, to close off this section, we comment on the contri- 
butions to the overall psfr from the different classes in Table[3] It 
is noteworthy that despite only contributing about 23% of the total 
stellar mass in the local universe, the SF class is the dominant class 
in terms of SFR density. Over 50% of psfr, comes from this class. 
Likewise it is interesting that up to 15% of the total SFR takes place 



2 F 



1 r 



r 



-3 r 



































-, , , , 1 , , , , V ,1 , , , 1 , , , 





9 10 
Log M,/Mo 



11 



Figure 17. The relationship between the stellar mass and the SFR (both 
inside the fibre) for all galaxies with no AGN contribution. The figure has 
been volume weighted and normalised in bins of stellar mass. The contours 
are therefore showing the conditional likelihood of SFR given a stellar mass. 
The bin size is 0.1 X 0.1 in the units given in the plot. The red line shows 
the average at a given stellar mass, whereas the blue line shows the mode 
of the distribution. The dashed lines show the limits containing 95% of the 
galaxies at a given stellar mass. 



in gala xies that show signs of AGN activity (cf . iKauffmann et alJ 



7 THE OVERALL PROPERTIES OF LOCAL STAR 
FORMING GALAXIES 

We now study the dependence of the SFRs on other physical pa- 
rameters of the galaxies. Most of the results we show below will 
be familiar, as the general trends have been known for a long time. 
The main importance of what follows is that for the first time it is 
possible to derive the full distribution functions for these quantities. 
This adds considerable quantitative information to what was known 
before. One important point is that the results shown in this section 
are only very weakly dependent on the systematic uncertainties dis- 
cussed previously. We will therefore ignore these in what follows. 

We start with Figure 1771 which shows the SFR distribution 
as a function of stellar mass for the SF, low S/N SF and unclas- 
sifiable classes. The figure has been volume weigthed and nor- 
malised in bins of stellar mass so it shows the conditional proba- 
bility of SFR given a stellar mass. The clear correlation between 
SFR and stellar mass over a significant range in log Af* is notice- 
able and will be a recurring theme in what follows. Note that that at 
log M,/Mq > 10, the distribution of SFRs broadens significantly 
and the correlation between stellar mass and SFR breaks down. 

In addition to the correlations between different galaxy pa- 
rameters, it is instructive to study 1-dimensional projections of our 
multi-dimensional distribution functions. This enables us to ad- 
dress the question of how much of the total SFR density takes place 
in different kinds of galaxies. To do this we carry out the same 
Monte Carlo summation that we used to derive the total SFR den- 
sity, but this time in bins of mass, concentration, size, surface den- 
sity etc. The results of this exercise are shown in Figures [T8l and l 1 9| 
These show the star formation density per bin for a set of 8 differ- 
ent physical quantities. The bin size is indicated in each panel. The 
shaded gray area shows the 68% (la) confidence interval deter- 
mined from the Monte Carlo and bootstrap summations. 

The top left panel in Figure fT8l shows psfr as a function of 
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Figure 18. The total star formation density, psfr. as a function of Top left: The central concentration of the galaxies. As discussed in sectionQthis is a proxy 
for a proper morphological classification. Top right: The stellar mass and Bottom left: The stellar surface mass density. The final panel shows psfr as a 
function of the galaxy half-light radius. The shaded region in each panel shows the 68% confidence region. The units are given on the left, and the bin size is 
given in each panel. 



the central concentration of the galaxies, defined as the ratio of the 
radius containing 90% of the r-band lig ht t o that conta ining 50%. 
As discussed by Shimasaku et al. (2001) and Stra teva et alj (2001 ). 
this quantity correlates quite well with morphological type for the 
galaxies in the SDSS. It related to other similar quantitat ive mea- 
sures of galaxy concentration discussed pre viously (e.g. [Morgan 
Il958t iDoi et~allll993l : lAbraham et alJll994 . The plot shows that 
the psfr distribution is broad and that it peaks around the value 
for a pure disk R00/R50 = 2.33. The SFR density contributions 
split into individual galaxy sub-classes are shown by thin lines as 
described in the legend. We see that the SF class contributes pri- 
marily to the disk-like part of the distribution. The star-formation 
in AGN occurs in galaxies that are more bulge-like. 

The top right panel shows psfr as a function of galaxy stellar 
mass. We can observe several interesting features here: 

• A considerable amount of star formation takes place in low 
mass galaxies. The best-fit Schechter function has a faint-end 
slope of —0.45 and a characteristic mass of log M/Mq = 10.95 
with a possible transition to a steeper slope of qfs ~ —0.55 at 
log M/Mq < 9.8. Approximately 50% of the total SFR takes 
place at M. > 2 x 1O 1O M and about 90% at M* > 1O 9 M . 



• The SF class almost completely dominates the SFR budget at 
masses less than about 1O 1O M0. 

• At masses > 1O 1O M0 the majority of SFR takes place in 
galaxies low S/N SF class with important contributions estimated 
from galaxies that either cannot be classified or which show signs 
of AGN in their fibre spectra. 

Many of the same trends can also be seen in the bottom left 
panel w hich shows psfr as a fu nction of the stellar surface mass 
density ( Kauff mann et alj|2ob 3bi). Taken together with the top left 
hand panel, it is clear that the majority of the SFR comes from 
high-surface brightness spirals. These typically have half-light radii 
around 3.0 kpc (slightly smaller than the Milky Way). 

Turning to Figure fl9l we now plot quantities that are deter- 
mined inside the fiber aperture on the a>axis. We caution that since 
we only know these quantities within the fibre aperture, they may 
not be representative of the galaxy as a whole. The top right hand 
panel shows that most of the star formation takes place in galaxies 
with low D4000. 12% of the total SFR density comes from galaxies 
with D4000> 1.8, 2% with D4000> 2.0. Note that the star forma- 
tion in these galaxies comes entirely from their outer parts and they 
are probably spiral systems with significant bulges. The bottom left 



The physical properties of star forming galaxies 19 
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Figure 19. Like the previous figure but this time we show parameters determined inside the fibre along the ic-axis and the distribution functions are therefore 
relative to the SFR density inside the fibre. The top left shows the distribution with respect to the 4()00A break, the gas-phase oxygen abundance is at the top 
right, the present-to-past average star formation rate (bottom left) and finally the dust attenuation at Ho in the bottom right panel. 



panel shows that the majority of star formation is taking place in 
galaxies that have a present-to-past average star formation rate be- 
tween 0.3 and 0.5. 

The top right panel shows the distribution as a function of gas- 
phase oxygen abundance. Since the metallicity is derived from the 
fits to the CL01 models, we only show results for the SF class. It 
is interesting that most stars are forming in galaxies with solar or 
slightly super-solar metallicities, depending on the ov erall calibra- 
tion of the metallicity scale (e.g. lKennicutt et all20 03:). Finally the 
bottom right panel shows the star formation density as a function 
of the attenuation at Ha (again, only for the SF class). This shows 
a peak around 1.0 magnitudes of extinction in Ha, in good agree- 
ment with other determinations (e.g.|Kgnnicutt|l9 83; Sulliv an et alJ 
l200fi iHopkins et aDl2003l: iNakamura et alJl2003l C02). Notice 
that we calculate the SFR weighted distribution function of Ahc, 
whereas most earlier studies have focused on a sample averaged 
value. It is clear from Figure |6| that any such averaging will de- 
pe nd strongly on the sa mple used. This has also been pointed out 
bv lHopkinsetalJ (2003). who show that a radio-selected subsample 
of the SDSS has considerably higher attenuation than the average 
SDSS galaxy. 

We now compare these SFR distributions with number and 



mass density distributions. Number distributions as function of 
many photometric properties and as a function of loca l density 
have been discus sed in detail by Blanto n et alJ l2003al) and by 
iHogg et alH2003l). Likewise the mass distributions have been dis- 
cussed by Kauff mann et alJ l2003albt) . 

Figure|20|is analogous to Figure fT8l and shows the distribution 
of psfr together with pm, , the mass density distribution, and pN, 
the number density distribution. In these plots we have normalised 
the distributions to the total within the plot limits. We remark that 
the spikes seen in several of the number density distributions are 
due to a handful of galaxies that are extremely close to our selection 
limits with very high volume corrections. The bootstrap errors on 
these spikes are large. 

Figure 1201 shows that although high concentration galaxies 
are few in nu mber, they contain the majority of the mass (see the 
discussion by iKauffmann et alj|2003al) . As discussed previously, 
most of the star formation occurs in disk-like systems. The top 
right hand panel shows that the majority of galaxies are small (cf. 
Blanton et al. 2003a) and that the galaxies dominating the star for- 
mation budget and those dominating the mass budget are fairly sim- 
ilar in size. The bottom left hand panel in the figure shows the de- 
pendence on mass. We see that star formation is typically occurring 
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Figure 20. The contribution to the total number, mass and star formation density as a function of various galaxy parameters. The SF density is shown in blue, 
the mass density in red and the number density in black. Top left: The contribution to the different densities as a function of the concentration of the galaxies. 
Top right: The same, but as a function of the half-light radii of the galaxies. Lower left: The density contributions as a function of log stellar mass and Lower 
right: The contributions as a function of log of the stellar surface density in Af0/kpc 2 



in galaxies that are somewhat less massive than those that contain 
the bulk of the stellar mass in the local Universe. As we noted pre- 
viously, most of the star formation is taking place in high surface 
brightness galaxies. However, the the bottom right panel shows that 
the star formation is distributed over a wider range in stellar surface 
density than the stellar mass. This is presumably because a signif- 
icant fraction of the stellar mass is located in ellipticals, which are 
not currently forming stars. 

Figure l2"Tl is analogous to Figure [T9l and shows properties mea- 
sured within the fiber aperture. As discussed previously, the major- 
ity of the star formation is occurring in galaxies with low D4000. 
It is nevertheless intriguing that the majority of galaxies in the uni- 
verse have even lower D4000 than those that dominate the SFR 
density. A very similar result is illustrated in the top right-hand 
panel, which contrasts the contributions to pjv, psfr and pm, from 
galaxies with different present-to-past average star formation rates. 

The bottom panels show the density distributions as a function 
of the attenuation at Ha (left) and as a function of gas phase oxy- 
gen abundance (right). These quantities are only calculated for the 
SF class. We see that the majority of SF galaxies have low metal- 
licity and dust attenuation, two facts that are likely to be closely 



connected. Most of the mass and most of the star formation are 
found in galaxies with a wide distribution of Ah a, centred around 
1.0 magnitudes with a spread of ~ 0.7 magnitud es for th e SFR 
weighted distribution (see also the related study by Hopkins et al. 
2003). For the Ah a distribution weighted by stellar mass, we find 
an average value of ~ 1.3 magnitudes and a similar spread. The 
difference reflects the fact that the most massive galaxies are more 
dusty than the galaxies dominating the SFR budget. 



8 THE SPECIFIC STAR FORMATION RATE 

The total SFR of a galaxy is interesting and for certain physical 
questions it is the relevant quantity. But given the strong correlation 
between SFR and mass (cf. Figure [171 . it is clear that by normal- 
ising the SFR by the stellar mass, one can more easily study the 
relationship between star formation activity and the physical pa- 
rameters of the galaxies. In this section we will turn our attention 
towards the star formation rate per unit mass, the specific star for- 
mation rate. For compactness we will refer to the specific SFR as 
f sfr below. 

The specific star formation rate is closely related to several 
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Figure 21. Like the previous figure, but this time showing the star formation density versus the 4000A break (upper left panel) the present to past-average SFR 
(upper right), the dust attenuation at Ha in magnitude (lower left) and the gas-phase oxygen abundance (lower right). The latter two use only the SF class. 



other important physical quantities. It defines a characteristic time- 
scale of star formation, 



rsFR = r SFH = — . 



(8) 



Since the supernova rate is proportional to the star formation rate, 
the specific star formation rate represents the current input of su- 
pernova energy per star in the galaxy. Likewise, if the stellar mass 
is proportional to the mass of the ISM, i t is re lated to the porosity 
of the ISM in the galaxy JClarke & Oell2002l) . Finally r SFR does 
not depend on cosmology and is insensitive to the IMF as long as 
the high mass slopes are comparable. 

The star formation per unit mass has either explicitly or im- 
plici tly been used in numerous studies of fi eld galaxi es at z < 1 
(e.g. [Brinchmann & Eliisll200ri Icowie et aljll99ti iGuzman et alJ 
1997; Ellis; 1 19971 and references therein) by using equivalent 
widths of Ha or [O II]3727. In the local Universe, it has however 
often been rephrased in terms of the present to past-average star 
formation rate (e.g. Kennicutt 1983; Scalo 1986; Kennicutt et al. 
ll994llBoselli et all200ll;lGavazzi et all2002h . This is a suggestive 
quantity, which immediately gives an indication of the past star for- 
mation history of the galaxy and its relation to present-day activity. 
The main disadvantage of the present-to-past average SFR is that it 



involves more assumptions than the specific star formation rate as 
we will see below. 

The conversion between the two can be succinctly sum- 
marised as 

SFR 



(SFR) 



= r S FRT(l-7?), 



(9) 



where R is the fraction of the total stellar mass initially formed that 
is return to the ISM over the lifetime of the galaxy, and T is the 
time over which the galaxy has formed stars (T = tu{z) — tform, 
where tn(z) is the Hubble time at redshift z and tf orm is the time 
of formation). 

Given a choice of IMF, it is possible to reliably estimate R us- 
ing stellar evolution theory. We use the median R estimated for our 
galaxies which is R ~ 0.5. For T it is customary to take T = tit, 
ie. assume that all galaxies started to form stars at t = 0. We will 
adopt this convention below (with T = tn (0.1)). This assumption 
is sensible when comparing average b values for large samples, but 
might be more questionable for individual galaxies. In general, this 
means that the b values we calculate will be upper limits. 

Throughout this discussion, we will use b and tsfr inter- 
changeably, dependin g on which is more appropriate. As high- 
lighted bv lRocha-Pinto et"al]j200ol) . the b value is a convenient pa- 
rameterisation of "burstiness" and b — 1 provides a convenient di- 
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Table 4. The best estimate of the present-to-past average star formation rate, b v . We have averaged the values form /3 = 3 and j3 = as appropriate for our 
systematic error estimates. 



Sample 




Fibre 






Total 






16% 


Mode 


84% 


16% 


Mode 


84% 


Total 


0.313 


0.314 


0.325 


0.406 


0.408 


0.413 


SF 


0.994 


0.997 


1.011 


0.881 


0.888 


0.898 


Low S/N SF 


0.198 


0.201 


0.202 


0.421 


0.432 


0.437 


Composites 


0.396 


0.406 


0.464 


0.362 


0.365 


0.373 


AGN 


0.210 


0.216 


0.232 


0.229 


0.233 


0.239 


Unclassifiable 


0.030 


0.031 


0.032 


0.131 


0.133 


0.138 



vide between galaxies consistent with a constant or declining SFR 
over the age of the universe and those that have a higher SFR at 
present than they have typically had in the past. 

Before turning to the results for our sample, we note that there 
are at least two reasonable definitions for the average tsfh and one 
must take care to specify which one is discussing. 

The average ?-sfr value of a sample of N galaxies is simply 



/SFR\ _ i 



yp w SFR, 



(10) 



where the weights Wi can be unity or e.g. 1/V max depending on 
the question asked. This is the appropriate quantity to quote when 
talking about the typical galaxy. In practice we calculate this using 
the full likelihood distributions of ?-sfr for each galaxy. We also 
usually choose to quote the median rather than the average as this 
is binning insenstive (cf. discussion in Appendix 1X1. 

r SFR can De ver y different from the tsfr value for a volume 
limited sample: 



V _ PSFR _ J2i WiSFRi 
r SFR. — 



PM, 



(ii) 



which is the appropriate quantity to quote for the the star forma- 
tion history of the universe as a whole. To calculate r|p R using our 
likelihood distributions, we employ the same Monte Carlo summa- 
tion technique used in the calculation of psfr in section [6] It is 
important to include the stellar mass in this Monte Carlo code, be- 
cause, as mentioned in Appendix lAl neglecting to do so will bias 
our results. (To investigate the size of this bias we carried out the 
calculation of r|p R with and without the full likelihood distribu- 
tion of log M* and found that ignoring the uncertainty in log M* 
resulted in an overestimate of tsfr of about 15%.) 

We begin by calculating b v for the sample as a whole. This is 
a straightforward application of the results in the previous section 
and we show this quantity in Table|4| We see that for h = 0.7, the 
present day universe is forming stars at slightly higher than 1/3 of 
its past average rate, depending on the time-span one averages over 
in equation |9j- Once again we have calculated the likelihood dis- 
tribution using the Monte Carlo method discussed in Appendix lAl 
with 100 bootstrap repetitions and 30 Monte Carlo sums for each 
repetition. The resulting errors are also indicated in Table|4| but as 
with the total SFR density in section|6|the random errors are sub- 
stantially smaller than the systematic ones. Following the discus- 
sion in section|6| we adopt the same estimates of systematic errors 
on the total SFRs and find a present-to-past average SFR for the 
local universe of 



b v (z 



0.1) 



0.408Z 



I (rand. 



v +0.029 



(sys.; 



(12) 



with random errors corresponding to the 68% confidence interval. 

It is interesting to compare b v for the different classes we have 
defined. If we look at quantities determined inside the fibre, it fol- 



lows almost by definition that the SF class has the highest b value 
and the unclassifiable category the lowest. If we look at the values 
derived from the total SFR, the SF class still dominates, but it is 
interesting to note that the global b value for this class is the same 
as the central one. For the other classes one typically finds that the 
global b is higher than that inside the fibre. This is, of course, partly 
a sample selection issue: galaxies classed as SF are those that have 
relatively more star formation inside the fibre. It is also interesting 
to note that galaxies showing indications of AGN activity form an 
intermediate category between the Composites and the unclassifi- 
able. This is again logical because AGN would have been classified 
as composites if they had substantially higher SFR. 

We now study how the specific star formation rate varies be- 
tween galaxies of different stellar mass. We split the galaxy sample 
into nine bins in total stellar mass. We sum up the likelihood dis- 
tributions of rsFR, ie. we calculate rg FR without the normalisation 
in Equation 1 1 ()t . We show the resulting distributions of rg FR in 
Figure l22l The histograms are normalised to the total number of 
galaxies within that bin. No volume correction has been applied, 
but the difference to that shown is very small except in the lowest 
mass bins. (Note that we might not be complete in mass in the low- 
est bin.) The contribution to the distributions that is located outside 
the plotted range has been distributed on the two lowest and two 
highest bins respectively. 

The histogram for galaxies with 10.0 < log M* < 10.5 is re- 
peated in each panel for reference. The shaded histograms show the 
distributions of those galaxies clas sified as recent sta rbursts based 
on their D4000 and HS A values bv lKauffmann et alJ <2003al) . The 
thick black line shows the median volume weighted tsfr value 
(r-g FR ), and the red line shows rg FR . The latter is higher as the 
total SFR is dominated by galaxies with extreme values of tsfr- 

Figure l22l shows that there is a reasona bly good agreem ent be- 
tween the recent burst estimates of Kauffmann et al. ( 2003a), which 
are sensitive to activity in the last Gyr, and the activity indicated by 
emission lines. Thus galaxies that are currently undergoing elevated 
star formation activity (relative to the Hubble time) have typically 
done so for at least a Gyr. It is also clear that the fraction of galaxies 
that show clear signatures of recent starbursts declines rapidly with 
mass. This can either be due to a lower gas fraction in more mas- 
sive galaxies and/or the fact that periods of elevated star formation 
may be shorter in duration in more massive galaxies. 

Figure |22| also shows that the peak of the distribution shifts 
rather little over a large range in mass (log M» < 10.5). The peak 
value is very close to b = 1. In other words, most galaxies with 
log M* < 10.5 are forming stars at a rate that is consistent with 
their past average (ie. at a constant rate). Above few x 10 10 Mq, 
we see a clear change in the shape of the distribution of b 9 with 
a transition towards a low specific SFRs. This coincides well with 
the transition mass defined by Kauffm ann et alji2Q03al) . Also note 
that the second peak showing up at tsfr ~ 10 -12 at high mass is 
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Figure 22. The observed specific star formation as a function of total stellar mass in the mass ranges indicated in the top left hand corner of each panel. Shaded 
areas show recent starbursts, see text for details. Each histogram is normalised to the total number of galaxies in the mass range, given as N in each panel. 
The dashed histogram is the distribution in the log M* 6 (10.0, 10.5] bin and is included for reference. The thick black bar shows the average b value, b 9 , in 
each mass range and the thick red bar shows b v . The latter is not shown in the first panel since we might not be statistically complete there. All objects with 
log SFR/M* ^ — 12 have been placed as a Gaussian around log SFR/M* = —12 as discussed in the text. 



broader than the one at high rsFR because the SFRs are less well 
constrained there. 

Despite this near-constancy of the peak, there is a marked 
change in the shape of the tails. At low masses there is a substantial 
tail towards higher 6-values, ie. a trend towards more bursty star 
formation activity. This changes to a substantial tail towards low 
fa-values at high masses. Many massive galaxies are now forming 
stars at a depressed rate compared to their past average. 



9 THE BIRTHRATE PARAMETER AS A FUNCTION OF 
PHYSICAL PARAMETERS 

Section^focused on the distribution of the SFR density as a func- 
tion of different physical parameters. We will now study the rela- 
tion between the specific SFR and these same quantities. This will 
help us understand which factors control/are controlled by the star 
formation activity in individual galaxies. 

Most previous studies have focused on the trend with mass 
and Hubble type. A key question has been which of these pa- 
rameters correlate better with b. The studies of local spirals by 



iGavazzi & Scodeggid Jl996l) and iBoselli et all feOOlh indicated 
that the driving variable was the stellar mass content. Other stud- 
ies have made more of the correlation with Hubble type (e.g. 
iKennicutt et all 1994) . Here we will improve on these studies with 
a sample that is several orders of magnitude larger than the previous 
studies and with more careful and internally consistent modelling. 
We will not discuss the dependence on local density here. Recent 
studies of the correlation betwe en star formation and density can 
be found in lGomez et ail <2003l) and lBalogh eTaTl Il998ll200ll) . 

Throughout this analysis, we will show results using b 9 and 
b v derived from our total SFR estimates. In Figure l23l we show 
the trend of b with central concentration/Hubble type. The left hand 
panel in Figure l23l contrasts b 9 (solid line) with b v (crosses). The 
solid line shows the median of the unweighted b 9 calculated from 
the right panel using the formalism discussed in Appendix We 
do not show the confidence intervals, but as is evident from the full 
likelihood distributions, they are wide. The crosses show b v in each 
bin in Rgo/R^o- This is calculated using Monte Carlo summation 
and is basically the ratio of the black and red curves in Figure l20l 
In general b v is larger than b 9 because tsfr has substantial tails 
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Figure 23. The specific SFR as a function of concentration. The left panel contrasts b 3 with b v . The continuous line in this plot shows the median of the 
unweighted b 9 (right panel) and the crosses show b v calculated from the data in Figure l20l The right hand panel shows the (log of the) observed likelihood 
distribution of rspR with respect to the concentration parameter, R90/R50, calculated as described in the text. The shading shows the conditional likelihood 
distribution (volume corrected) given a value for i?go / R50 ■ The contributions to the likelihood distributions below the plotted range have been put in the two 
lowest bins in t*sfr- 
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Figure 24. Similar to Figure l23l but this time showing b as a function of the stellar mass. 



towards large values, 
from the median. 



b is therefore dominated by galaxies away 



plotted range in tsfr has been placed in the two lowest bins in 

rsFR- 



The right panel of Figure 21 shows the conditional likeli- 
hood distribution of tsfr given the concentration parameter, C = 
R90/R50, for the local universe. This has been constructed by as- 
suming that P(C) is independent of P(rsFR.) and assuming a 5% 
error on C (this latter assumption is not important for the results). 
Each distribution has been weighted by 1/V ma x- The distribution in 
the right panel then follows from a straight addition of all the in- 
dividual P(C, tsfr) and normalisation of the distribution in each 
bin of C to produce the conditional likelihood distribution. As in 
Figure l22l contributions to the likelihood distributions outside the 



These two panels illustrate the bimodal nature of the distribu- 



a number of papers (e.g 


IStrateva et all200ltlBlanton et all2003ii 


Kauffmann et al. 2003b 


Baldrv et al. 2003). As can be seen, galax- 



ies split into two basic populations: concentrated galaxies with low 
specific star formation rates, and low mass, less concentrated galax- 
ies with high specific star formation rates. The distribution at high 
concentration is broad however because there is also a fairly im- 
portant contribution from concentrated, actively star forming galax- 
ies at very low redshift. When the figure is plotted for the sample 
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Figure 25. Similar to Figure l23l but this time showing b as a function of stellar mass surface density. 
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Figure 26. Similar to Figure l23l but this time showing b as a function of °' 1 M r magnitude. 



directly, ie. without volume weighting, the two regimes are much 
more clearly separated. 

Figure l24l shows, with the same layout, the distribution of b 
as a function of stellar mass. We see that although there is a clear 
decline in b with increasing stellar mass, the trend in b 3 is weak 
over a large range in stellar mass until it suddenly drops at around 
lO 11 h^ o 2 M0. The b v value shows a much smoother trend. This is 
caused by the large spread in tsfr at given mass. If we compare 
Figures 1241 and l23l we see that the Hubble type correlates some- 
what less well with the b parameter than the stellar mass does. 

Figure [25] shows b as a function of stellar mass surface den- 
sity, /i*. T he conn ection bet ween thi s and M* has been discussed 
in detail by Kauffmann et al. 1 2003b). Those authors show that /i„ 
correlates well with M», except at the very highest masses where 
fi» reaches a saturation value. It is therefore not surprising that we 
see a very similar trend with ft, as we saw with M* in Figure l24l 
When we plot the figure without volume weighting it is very simi- 



lar but the low SFR peak becomes more prominent at high fi, than 
at high M* , suggesting that the surface density of stars is more im- 
portant than the stellar mass in determining when star formation is 
turned off. 

Figure [26] again presents a similar view. This time the b pa- 
rameter is plotted as a function of absolute r-band magnitude at 
z = 0.1. The correlation here is fairly good. For galaxies fainter 
than about M T = —20, the b g parameter increases by about 40% 
per magnitude. 

Figure |2*7I shows the relationship between b and the 4000A- 
break. The left panel shows th is to be a fairly tight relation — one 
reason why iKauffmann et alj 12003 al) were so successful in con- 
straining the M/L ratio for galaxies using the 4000A-break and 
the H8a index. We note that this relation does have the potential 
to constrain star formation in high redshift galaxies, as the Balmer 
break can be measured out to considerably higher redshift than Ha. 
From Figure l27l we see that a measurement of D4000 allows one to 
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Figure 28. Similar to Figure l23l but this time showing b as a function of the half-light radius of the galaxies. 



constrain b g to within a factor of three (la) for D4000< 1.6. Al- 
though this is not very precise, it should be weighed against the fact 
that it is very difficult to estimate star formation rates to better than 
within a factor of two with any technique. It is however important 
to caution that the D4000 index intrinsically depend on the SFR in 
the last giga-year rather than the current SFR, as well as the metal- 
licity. It is therefore necessary to assume that the present SFR is 
similar to the average activity in the last Gyr to make recourse to 
the relation in Figure l27l 

We caution that D4000 is measured within the fibre, whereas 
our SFR estimates are for the entire galaxy. If we make the same 
plot using SFR estimated within the fibre, we still recover a tight 
correlation between b 9 and D4000. Moreover, we find that b 9 and 
b v now coincide. It is galaxies that have high D4000 within the 
fibre but significant star formation outside which make b v consid- 
erably higher than b 9 at high D4000 in Figure l27l 



We finish by showing b versus Rso in Figure l28l It is clear that 
the low and high b galaxies co-exist over the entire range in ifeo. 
But there is a general trend for higher b v at the smallest sizes and 
for large galaxies to have typically lower values of b . 

9.1 Comparison with previous studies 

Before we turn to a discussion of these results, it is enlightening 
to compare our results with previous work in more deta il. The 
main c omp arison samples are th ose presented bv iKennicutt et al] 
1 1994 and lGavazzi etai] f2002) in their studies of star formation 
histories in spiral galaxies. They calculate ^-parameters for a range 
in morphological types and in Figure [29] we plot this distribution 
of b versus approximate morphological type for our sample. We 
have taken the relation between RC3 T-type and concentration from 
IShimasaku et alJ 1200 ll) to convert concentration measures into ap- 
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Figure 29. The present to past-average star formation rate for the sample 
plotted against central concentration in the r-band. The top x-axis shows the 
mean T-type implied by the concentration according to the recipe described 
in the text. The crosses show the sample of Kennicutt et al. 1 1994) and the 
boxes that o flGavazzi et al]|2002t) . 

proximate T-types. We fitted a linear relation between T and C 
from their Table 3 which is indic ated on the upper a xis. In the same 
plot we show the resultsjrf|Kermicutt etalJ 1 19941 their Table 4) 
as crosses and those of lGavazzi^HrtTl2002h as squares, with big 
squares corresponding to the median value (BCGs are not shown). 
The data we show for our study are the unweighted b 9 values which 
should be appropriate for comparison to local studies. 

There is clearly a considerable uncertainty in the x-axis, but 
the general agreement in trends and turn-over between the differ- 
ent samples indi cates that the conv ersion between C and T-type 
derived by IShimasaku et alJ l200l[) seems to be fairly reliable. It 
is also clear that the values from lGavazzi et all J2002) are consid- 
erably lower than ours. This may be due in part to differences in 
galaxy density in the two samples, but is more likely due to differ- 
ences in the method used to determine the stellar mass. Gavazzi et 
al use an average dark matter mass to stellar mass ratio to derive 
stellar masses. We have chosen not to attempt to adjust for dif- 
ferences in methodology, as our main interest is in comparing the 
overall trend. 



10 DISCUSSION 

The preceding sections have provided a detailed inventory of the 
star formation activity in the local universe. One possibly surprising 
result is that the dominant contribution to the SFR density comes 
from galaxies with on-going star formation in their inner few kilo- 
parsecs (the SF and low S/N SF classes). These galaxies contribute 
~ 72% of the total SFR density, whereas galaxies with signs of 
AGN (Composites and AGN) contribute about 15% of the total. 
The remaining 12% of psfr is located in galaxies we are unable 
to classify based on their fibre spectra. These are almost certainly 
galaxies with substantial bulges, in which stars are forming in the 
outer disk. A thorough examination of this question has to be post- 
poned to a different article, but it is clear that the relative impor- 
tance of star formation in the outer disk is a function of the mass or 
luminosity of the galaxy. When we look at narrow redshift ranges 
we find that the median aperture corrections increase by a factor 
of 2 from stellar masses around 1O 9 M to M» ~ 10 n M Q . The 
natural question is how this relates to star formation in bulges as 



compared to disks, but this will have to await a study of the re- 
solved properties of our galaxies. 

We found that galaxies in the SF class have SFR profiles that 
follow the r-band light. For these galaxies, it is possible, and accu- 
rat e, to remove ap erture bias by simple scaling with the r-band flux 
(cf. lHopkins et all2003l) . 

We have, for the first time, been able to calibrate the conver- 
sion factor from Ha to SFR as a function of mass and we show that 
the standard Kennicutt 1 1998) conversion factor is a very good aver- 
age correction for most galaxies in the local Universe. By lucky co- 
incidence, trends in the conversion factor and in the intrinsic Case 
B ratio of Ha to H/3 tend to cancel each other out. As a result, it 
will be sufficient to take a fixed Case B ratio and a fixed 

rf Ha for 

most studies, even though neither of the two assumptions is correct. 

One of the main questions we want to address is what excites 
large-scale star formation in galaxies. Seen from this angle the dif- 
ferent parameters we have studied fall into two categories. First 
there is the incidental set - those parameters which are controlled 
by star formation activity rather than the other way around. Among 
these are D4000 and colour. Of more interest are those parameters 
that may have some direct effect on star formation such as M, , /i* , 
Hubble type and size. 

We have found that the stellar mass correlates well with 
the S FR, a result found previousl y for a much smaller sam- 
ple by iGavazzi & Scodeggid ll996ll . This is quite likely because 
even though the gas fractions of galaxies decline with increas- 
ing mass, th e total gas masses still increase as a function of mass 
jBoselli et all200ll) . 

Because of this overall correlation, the SFR density distribu- 
tion functions are rather simil ar to the mass de nsity distribution 
functions discussed by Kauff in arm et alJ l2003al) . We find that the 
majority of star formation takes place in high surface brightness 
disk-dominated galaxies spanning a range of concentrations. These 
span a broad range of masses, peaking for galaxies slightly less 
massive than 10 11 Mq and with half-light radii around 3 kpc. The 
star forming gas has typically solar or slightly super-solar metallic- 
ity. 

Our most striking result is contained in Figure ri"8l which 
shows that the majority of star formation in the local universe takes 
place in massive galaxies. This is a robust prediction of hierar- 
chical mo dels and th e shap e of our distribution matches that of 
ISpringel & Hernuuistl 120031) (although these authors show /Osfr 
versus the halo mass). This clearly warrants a more in-depth com- 
parison and we will follow this up in a forthcoming paper. 

Another interesting question is whether star formation in the 
local universe is predominantly bursty or relatively quiescent. As 
we saw in Figure 1241 the answer depends strongly on the mass 
of the galaxy. It is safe to say, however, that the vast majority of 
star formation in the local universe takes place in galaxies that are 
not undergoing extreme starbursts. Quantitatively we find that 28% 
of Psfr comes from galaxies with SFR/ (SFR) > 1 and 50% at 
SFR/(SFR) > 0.54, but only 3% of psfr comes from galaxies 
with SFR/ (SFR) > 10. The traditional definition of star bursts is 
however b ^ 2-3 and with that definition around 20% of the local 
SFR density takes place in star bursts. We should caution that our 
method of inferring the SFR from optical emission lines is likely to 
miss part of the SFR occurring in extremely dusty FIR-dominated 
galaxies. Given that such systems are very rare at low redshifts (e.g. 
ISanders & Mirabell 996). this does not significantly affect our re- 
sults. 

Although IR-luminous galaxies are rare in the local universe, 
most local star formation occurs in galaxies with substantial opti- 
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cal attenuation in their central parts. It is important to realise that 
this quantity might suffer from aperture biases so the galaxy av- 
eraged attenuation could be lower. The relationship between dust 
attenuation and SFR activity has been discussed by many au thors. 
The most recent discussion is given by Hopkins et al. (2003) for a 
subsample of our SDSS data. Our results are in good agreement 
with theirs. Our main result is that the SFR-weighted dust atten- 
uation at Ha is Ahc ~ 1.0 ± 0.5 magnitudes, with a good cor- 
relation between dust attenuation and SFR. The exact relationship 
between dust and star formation remains unclear. We see evidence 
for a strong correlation between stellar mass and dust attenuation, 
presumably reflecting a correlation between metallicity and dust 
content. The correlation between SFR and dust may simply reflect 
a more fundamental correlation between dust and metallicity. This 
question clearly merits further investigation. Another consequence 
of the moderately strong dust attenuation present in most galaxies 
is that the colours of the galaxies that dominate the SFR density 
have intermediate values — this is why we needed a second colour 
to get a good constraint on star formation activity in section|5| 

To isolate the influence of other factors that may regulate 
the star formation, it is clearly necessary to take out the domi- 
nant trend with mass. One should probably normalise by the mass 
of the gas reservoir, but as this information is not available we 
have n orm alised by stellar mass . Previous studies bv lGavazzi et alJ 
(2002) and Bose lli et all 1200 ll) have shown that SFR/M* corre- 
lates with mass. We confirm this trend. The median value of tsfr is 
fairly constant over a wide range at low masses and then decreases 
smoothly at masses greater than M, ~ 1O 1O M0. The simplest 
interpretation is that low mass galaxies form stars at an approxi- 
mately constant rate, whereas the star formation history is increas- 
ingly peaked towards earlier times for more massive galaxies. 

These trends have arguably been seen before. They agree with 
the "downsizing" scenario of lCowie etai] ll99tj) . What has not 
been apparent up to now is the fact that the extremes of the dis- 
tribution change noticeably. As we go up in mass, there are fewer 
and fewer galaxies with very high tsfr and a corresponding tail 
towards low tsfr starts to appear. 

It is clear from the previous section that no single factor 
stands out as controlling t"sfr. This is not entirely surprising since 
the properties we have used are simple and somewhat limited. It 
might turn out that more detailed properties correlate better with 
b, for example lopsidedness iRudnick etaflj gOOO). local density 
iGomez et al1l2003l) or merging state (e.g. 
We will study these relations further in future work. 

We mentioned previously that the rgpR value for the sam- 
ple as a whole provides a strong constraint on the star formation 
histor y of the unive rse. This has been discussed in some detail by 
iBoselli et al] 1200 ll) . but that study did not use a SFH that is com- 
patible with look-back studies - it assumed an exponential star for- 
mation history for all galaxies. Our b v value with our cosmology 
corresponds to an effective r = 7^5 sGyr (including systematic 
errors) if one considers the whole universe to have an exponential 
SFH. 

We will however adopt th e para meterisation of the SFH of the 
universe used by Ba ldrv etalj j2002). In general form this reads 



SFRoc 



(l + z) a for ze[zd,z c 
{1 + zf for z<z d 
for 2 > z c 



(13) 



b v =0.408±0.090 ; 
H = 70 km/s/Mpcj 



s Or 




Figure 30. The constraint on a and (3 (see Equation ll31 from the b v value 
calculated for our sample. The solid line shows the locus of b v = 0.400 
whereas the dashed line shows the 68% confidence limits from the likeli- 
hood distributions and the dotted the locus of our conservative systematic 
eiTor estimates. 



showed that the local "cosmic spectrum" is well fit by a model with 
(3 = 3 and f3 = 0. 

To calculate b v for the SFH defined by Equation I13i we sim- 
ply integrate over time and define 



(SFR) = 



to 



SFR(t)dt, 



(14) 



with to being the age of the universe at z = 0.1. This, together 
with equation 1 1 3i . then gives the present to past-average SFR for 
the model 



= t 



SFR(tp) 
$1° S¥R{t)dt 



(15) 



where we will adopt Z d = 1, z c = 5 to be consistent with 
Baldr v et alJ 120021) and iGlazebrook et alJ 120031) . Those authors 



It is interesting to note that because of the way we measure M* 
and the SFR, there is a dependence on the Hubble parameter in our 
b value calculations. That dependence cancels out in equation 1131 . 
To constrain a and f3 we thus need a prior on Ho. We will take the 
simple approach and choose h — 0.7. We use the constraint on b v 
given in equation J 121 . 

Figure l30l shows the resulting constraints on a and (3. Since 
the b v value only constrains the z = 0.1 value of the SFR and 
the integral under the curve, there is a strong degeneracy even with 
a fixed functional form. This degeneracy cannot be lifted without 
independent constraints, but it is still of considerable interest. Most 
noticeably it is clear that any really strong evolution, ft > 4, at 
z < 1 is excluded by the current data. 

The dotted lines in Figures l30l show the effect of systematic 
uncertainties on these constraints (for simplicity we have sym- 
metrised these). The dashed lines show the purely statistical errors 
(lcr). It is clear that if we could remove all systematic uncertain- 
ties, the method would give a very powerful constraint on the star 
formation history of the universe. It is therefore worth asking what 
is required to further reduce the systematic uncertainties. 

A thorough test of our aperture corrections using resolved 
spectroscopy would substantially reduce the systematic uncertainty 
in our SFR estimates. To reduce uncertainties due to evolution, one 
could limit oneself to a narrow range in redshift, but that would 
mean a corresponding increase in the uncertainty in extrapolating 
to faint galaxies. Alternatively, it might be possible to directly con- 
strain the evolution inside the sample, but that is a rather delicate 
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operation due to the complex aperture effects. We have postponed 
this to a later paper. 

Systematic uncertainties notwithstanding, the results pre- 
sented here provide remarkably good constraints on the cosmic 
star formation history using only data at z < 0.22. It is clear that 
a similar study carried out at higher redshift will be of great in- 
terest and provide a very interesting comple ment to recent stud- 
ies of the mass-assembly of the universe I Brinchmann & Ellis 
2000HDrorv et alj200ll:lbickinson et all2003tlFontana et all2003t 
Rudnick et all2003l) . 

In general it is our firm belief that future studies of high red- 
shift data should try as far as possible to study the evolution of the 
full distribution functions in question rather than summarising the 
distributions to a single value. As we have seen, important changes 
might occur in the tails of the distributions with little influence on 
the median value. These physically interesting details can only be 
extracted through the use of distribution functions. It has been stan- 
dard to study luminosity functions. With the techniques introduced 
in the present paper it is now possible to extend the analysis to a 
wide variety of distribution functions. This has been possible due 
to the unprecedented dataset provided by the SDSS survey. 
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APPENDIX A: MANIPULATION OF LIKELIHOOD 
DISTRIBUTIONS 

Throughout this paper we use the full likelihoods of all parame- 
ters. This means that in many situations it is necessary to combine 
likelihood distributions. For the benefit of the reader we summarise 
some of the most important results here. We will use standard no- 
tation and write e.g. fz(z) for the likelihood of a value z in the 
likelihood distribution Z. Thus we represent the addition of two 
likelihood distributions as Z = X + Y. This sum is discussed in 
most introductory statistics textbooks and the well know result is 
that 

indcp. 



fz(z) 



E 



fxy(xi,z-Xi) ™= P ' fx {xijfv (z-Xj), (Al) 



where the latter equality is valid in the case of independent vari- 
ables. 

In the present work we do not often use the straight addition 
of distributions. The general case is not normally given in statistics 
textbooks. It is however easy to derive. We will focus our attention 
on the cumulative distribution function (CDF) F(z ^ z). In the 
continuous case this is given as 



F(z «S z) 



fxr(x,y)dx dy, 



(A2) 



where 1Z is the region where the constraint z ^ z is fulfilled. If 
we write the general composition of two likelihood distributions 
as Z — g(X,Y) then the constraint z ^ z translates into z ^ 
g(x, y). For well-behaved distributions we can invert this to write 
y ^ h(x, z). In general we can then write 



F(z ^ z) 



h(x,z) 



fxv(x,y)dx dy. 



(A3) 



oo J — oo 



A simple change of variable presents itself and by setting y 
h(x, u) we get 



F(z z) 



OO fZ 



fxY(x,h(x,u))J(h)dxdu, (A4) 



with J = dh/du being the Jacobian of the variable transformation. 
By taking the derivative of this we can get the likelihood distribu- 
tion fz(z): 



fz(z) 



dF 

dz 



fxY (x,h(x,z)) \ J{h)\ dx, 



(A5) 



which can be evaluated for any given g(X, Y). The extension to 
discrete distribution functions is straightforward (see below). 

As an example we can take the addition of two likelihood dis- 
tributions for the SFR which we need for the aperture corrections. 
Here we have 



Z = log 10 (10 x + 10 y ) 



(A6) 



since the likelihood distributions are in log space. Thus calculating 
the Jacobian and writing it as a discrete distribution 



fz(zj) = 

^/xM.Miog 10 (io 2 



10**)) 



10*3 



10 2 



10* 



(A7) 



This formalism extends fairly straightforwardly to the iterated 
composition of many (independent) likelihood distributions 



fz(z) = 



fx 1 {xi)fx 2 [h 2 (xi,x 2 )] ■ 



fx N [/ijv(a;jv-i, xn) 



n 



dhi 
dxi 



d N Xi 



(A8) 



but this is not a practical approach for arbitrary likelihood distri- 
butions due to combinatorial explosion (the sum/integral has to be 
done over all combinations of variables that combine to a partic- 
ular value of z). Instead it is easier in this situation to calculate 
the combined likelihood distribution in an approximate manner by 
carrying out a Monte Carlo sampling of the distributions. In this 
case we first form the CDF, F(x), for each galaxy. We then draw, 
for each galaxy, a random number, p, between and 1 and solve 
F(x) = p numerically using linear interpolation. 

We use the latter approach for instance when evaluating the 
star formation density of the local universe. The main disadvantage 
of a Monte Carlo approach is that one needs to carry out a large 
number of repetitions to ensure a sufficient sampling of the final 
likelihood distribution and to estimate the confidence intervals with 
good accuracy. 

Before continuing we would like to emphasise a technical is- 
sue which is of considerable importance for some combinations 
of likelihood distributions: Ignoring the likelihood distribution of a 
particular variable (in effect setting its error to zero) will in general 
bias the estimate of any combination of this variable with any other 
unless they are both Gaussians. A specific example relevant for our 
work is the ratio of two variables whose likelihood distributions are 
kept in log space. 

In particular the calculation of rsFR in section|8|falls into this 
category. Here we have Z = 10 y /10 x where X is the likelihood 
distribution of the stellar mass and Y that of the SFR. If we now for 
illustrative purposes assume that these are Gaussian in log space: 
log SFR ~ N (log SFRq, (t S fr) and likewise for log M* then a 
straightforward calculation gives us that the mean of Z 



(z) 



zf z (z)dz, 



(A9) 



oo </ — oo 



can be written 
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M = e logSFR/M, 0+ ( CT ^ +t r| pR )/21 n 10^ (M()) 

Clearly setting <tm« = would bias the average value for 
the specific SFR in this situation. The exact amount depends on 
the details of the likelihood distribution and the bias could vanish 
entirely. In practice (as mentioned in section [8} we find that for 
the calculation of the specific SFR, ignoring the uncertainty in A/„ 
would bias our results about 15% high. 

Finally we would like to mention the various ways one can 
summarise a distributio n and what we typically use (see also 
iKauffmann et alj f2003a). The average is the simplest as it is cal- 
culated as 

(x) =^Xifx{xi), (All) 

i 

since fx is normalised to a total of one. The mode is equally simple 
imodc = Xi such that maxfx(xj) = fx(xi). (A12) 
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In the event of a tie we use the smallest value satisfying the cri- 
terion. These two, but in particular the mode are affected by the 
binning chosen for the likelihood distributions. 

More commonly used in the present paper are the percentiles 
of the distributions, defined in the standard way as 

F x (x p )=p, (A13) 

where Fx is the CDF, p is the percentile — p — 0.5 gives the 
median, p — 0.16 and p = 0.84 give the upper and lower "lcr" 
limits chosen and x v is the value to solve for. 

To avoid severe binning effects (in particular for the extremes 
of the distributions) we calculate these percentiles by solving equa- 
tion IA13I using linear interpolation between grid points. In this 
situation we interpolate the CDF and this is a robust procedure. 
For the average a straight application of equation <A1 H would re- 
quire interpolation of the probability density function (PDF) which 
might depend sensitively on the interpolation scheme. Instead we 
calculate the average by interpolating the CDF and drawing 10 4 re- 
alisations of this for each galaxy and calculating the average from 
this. We have experimented with different numbers of realisations 
and interpolation schemes and found that the estimate of the av- 
erage so calculated is robust to these changes within the accuracy 
needed for our project. 



use this to determine the [O IIl]5007-flux from the PNe p opula- 
tion in a given galaxy. Using the PNe LF in Ciardullo et al. 1 2002) 
we find that the maximum contribution to the total [O IIl]5007-flux 
of the sample by PNe is of the order of 1.5%. This is in general 
totally negligible in comparison with other sources of uncertainty, 
but it is worth noting that we do predict that among galaxies with 
very weak emission lines, S/N < 5 in [O III]5007, around 20% 
of the galaxies could have PNe contributing more than 20% of the 
[O IIl]5007-flux thereby complicating studies of galaxies with very 
weak emission lines. Since the [O III]5007/Hq ratio in PNe is in 
general considerably larger than for star forming galaxies, the effect 
on Ha and hence the star formation estimates, is therefore entirely 
negligible except at S/N < 1 in Ha so has no effect whatsoever 
for our results here. 

A significant amount, typically 30-50%, of the emission 
line flux in galaxies comes from the diffuse i onized g as (DIG 
i Lehnert & Heckmarll994l : IWang et alll997l:lHoopes & Walterbosl 
120031) . This is implicitly taken into account by the CL01 models be- 
cause as CL01 show the combination of the (low-ionization) DIG 
and the (higher ionization) Hll-regions is typically represented by 
a lower effective ionization parameter (see also Mathis 2000). Al- 
though we_do_not explicitly incorporate leaky Hll-regions as done 
by e.g. Hoopes & Walterbos 1 2003), we can produce similar line- 
ratios by changing £. Thus the models do not need any extra com- 
ponent to take into account DIG and as long as the DIG is heated by 
OB stars as seems necessary, its presence should not bias our SFR 
estimates. The interpretation of the fitted ionization parameter and 
dust-to-metal ratio is more complex and we will discuss this issue 
in more detail in an upcoming paper. 



APPENDIX B: MINOR CONTRIBUTIONS TO THE 
EMISSION LINE SPECTRA 

We will here discuss, briefly, the contribution of supernova rem- 
nants (SNRs) and planetary nebulae (PNe) to the overall emission 
line flux of galaxies. The SNRs are interesting in their own right, 
but due to their short lifetime and the fact that they co-exist with 
star forming regions they are unlikely to affect the line-radiation 
significantly . This is borne out in studies of resolved Hll-regions in 
the LMC JOev et aljE oOO) which show that SNRs only affect the 
emission line spectra at a fairly low level. 

As discussed by CL01, the contribution of PNe to the line 
radiation can be ignored in galaxies with active star formation, 
however in the current work we have galaxies with very low Ha- 
luminosities and it is therefore necessary to ask if these fluxes 
can come from PNe. It is straightforward to assess this as the 
PNe [O III15007 lum inosity function seems to be universal (e.g. 
ICiardullo et ai]l200i and references therein). We can therefore 



